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Intellectual Property Rights 

IPRs essential or potentially essential to the present document may have been declared to ETSI. The information 
pertaining to these essential IPRs, if any, is publicly available for ETSI members and non-members, and can be found 
in ETSI SR 000 314: "Intellectual Property Rights (IPRs); Essential, or potentially Essential, IPRs notified to ETSI in 
respect of ETSI standards", which is available from the ETSI Secretariat. Latest updates are available on the ETSI Web 
server ( http://ipr.etsi.org ). 

Pursuant to the ETSI IPR Policy, no investigation, including IPR searches, has been carried out by ETSI. No guarantee 
can be given as to the existence of other IPRs not referenced in ETSI SR 000 314 (or the updates on the ETSI Web 
server) which are, or may be, or may become, essential to the present document. 



Foreword 

This Technical Specification (TS) has been produced by Joint Technical Committee (JTC) Broadcast of the European 
Broadcasting Union (EBU), Comite Europeen de Normalisation ELECtrotechnique (CENELEC) and the European 
Telecommunications Standards Institute (ETSI). 

NOTE: The EBU/ETSI JTC Broadcast was established in 1990 to co-ordinate the drafting of standards in the 
specific field of broadcasting and related fields. Since 1995 the JTC Broadcast became a tripartite body 
by including in the Memorandum of Understanding also CENELEC, which is responsible for the 
standardization of radio and television receivers. The EBU is a professional association of broadcasting 
organizations whose work includes the co-ordination of its members' activities in the technical, legal, 
programme-making and programme-exchange domains. The EBU has active members in about 60 
countries in the European broadcasting area; its headquarters is in Geneva. 

European Broadcasting Union 
CH-1218 GRAND SACONNEX (Geneva) 
Switzerland 

Tel: +41 22 717 21 11 
Fax: +41 22 717 24 81 
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1 Scope 

The present document describes the key components of the DTS Coherent Acoustics technology. This edition has been 
enhanced to include additional extensions that can be applied either directly to the core, or carried in an extension 
substream. The extension substream may also be used without the core substream to create new coding profiles. 

The original DTS Coherent Acoustics coding system is a modular format designed to permit forward looking 
enhancements without causing the obsolescence of a baseline "core" decoder. The core decoder supports up to 
5.1 channels of audio (5 primary channels with 1 Low Frequency Effects (LFE) channel, at up to a 48 kHz sampling 
frequency. Through the use of extensions, the core can be enhanced with: 

• Additional discrete channels. 

• Additional matrixed channels. 

• Higher audio sampling frequencies. 

The addition of an extension substream provides extensibility to the original coding system by allowing additional 
channels to supplement the core, resolution enhancement of the core through residual coding extensions and frequency 
enhancements. An additional benefit of the extension substream is to enable new coding profiles that do not use the 
core, such as a low bitrate coding mode and an efficient lossless coding mode. When the extension substream is used in 
conjunction with the core, it is a superset of the original DTS Coherent Acoustics coding system and allows for a simple 
legacy compatibility by addressing the core substeam only New coding profiles, such as the use of lossless coding 
without a core and the low bit rate coding profile can maintain compatibility through the use of a low complexity 
transcoder. The main feature enhancements inherent to the extension substream are as follows: 

• More channels and greater flexibility in how they are applied, including replacement channels and dubbing 
tracks. 

• A rich metadata set to permit the combining of audio elements under both program and user control. 

• Enhanced metadata specifically for broadcast applications. 

• The ability to carry multiple embedded downmixes. 



2 References 

References are either specific (identified by date of publication and/or edition number or version number) or 
non-specific. For specific references, only the cited version applies. For non-specific references, the latest version of the 
reference document (including any amendments) applies. 

Referenced documents which are not found to be publicly available in the expected location might be found at 
http ://docbox. etsi.org/Reference . 

NOTE: While any hyperlinks included in this clause were valid at the time of publication ETSI cannot guarantee 
their long term validity. 

2.1 Normative references 

The following referenced documents are necessary for the application of the present document. 

[1] ETSI EN 300 468: "Digital Video Broadcasting (DVB); Specification for Service Information (SI) 

in DVB systems". 

[2] ETSI TS 101 154: "Digital Video Broadcasting (DVB); Specification for the use of Video and 

Audio Coding in Broadcasting Applications based on the MPEG-2 Transport Stream". 
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2.2 Informative references 

The following referenced documents are not necessary for the application of the present document but they assist the 
user with regard to a particular subject area. 

[i.l] ISO/IEC 14496-12: "Information technology - Coding of audio-visual objects - Part 12: ISO Base 

Media File Format". 

NOTE: Available at International Standards Organization, www.iso.ch ; International Electrotechnical 
Commission, www.iec.ch. 

[i.2] ISO/IEC 13818-1: "Information Technology - Generic coding of moving pictures and associated 

audio information: Systems". 

NOTE: Available at International Standards Organization, www.iso.ch ; International Electrotechnical 
Commission, www.iec.ch. 

[i.3] ISO/IEC 14496-14: "Information technology - Coding of audio-visual objects - Part 14: MP4 file 

format" . 

NOTE: Available at International Standards Organization, www.iso.ch ; International Electrotechnical 
Commission, www.iec.ch. 

[i.4] ISO/IEC 14496-1: "Information technology - Coding of audio-visual objects - Part 1: Systems". 

NOTE: Available at International Standards Organization, www.iso.ch ; International Electrotechnical 
Commission, www.iec.ch. 

[i.5] ISO 639-2:1998: "Codes for the representation of names of languages - Part 2: Alpha-3 code". 

NOTE: Available at International Standards Organization, www.ciso.ch ; International Electrotechnical 
Commission, www.iec.ch. 

[i.6] ISO/IEC 8859-1 (1998): "Information technology - 8-bit single-byte coded graphic character sets - 

Part 1: Latin alphabet No. 1". 

NOTE: Available at International Standards Organization, www.iso.ch ; International Electrotechnical 
Commission, www.iec.ch. 

[i.7] DTS Document #9302J81 100: "DTS-HD® PBR API Library Description". 

NOTE: Available from DTS, Inc., www.dts.com . 

3 Definitions, abbreviations and document conventions 
3.1 Definitions 

For the purposes of the present document, the following terms and definitions apply: 

audio frame: complete logical access unit of an audio stream that corresponds to a defined number of decodable PCM 
audio samples for a given time segment of the audio presentation 

audio stream: sequence of synchronized audio frames 

bit(n): pseudo type where the parameter n represents consecutive bits in the stream 

NOTE: Padding is never assumed where this is used. All stream parameters described using bit() are unsigned 
and MSB first aligned in the stream. 

ByteAlign(): pseudo function to pad to the end of the current byte with from to 7 bits 

boolean: value which resolves to either a logical 1 or 
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core substream: audio stream component that adheres to the original DTS Coherent Acoustics definition 

dependent substream: specific type of extension substream that is associated with a core substream 

DTS Core Audio Stream: carries the coding parameters of up to 5.1 channels of the original LPCM audio at up to 
24 bits per sample with the sampling frequency of up to 48 kHz 

DTS Extended Audio Stream: channel or frequency extension appended to the core audio component in the core 
substream 

DTS XCH Stream: one of DTS extended streams that carries the coding parameters obtained from encoding of up to 
1 additional channels of original LPCM audio at up to 24 bits per sample with the sampling frequency of up to 48 kHz 

DTS X96 Stream: DTS extended audio stream that enables encoding of original LPCM audio at up to 24 bits per 
sample with the sampling frequency of up to 96 kHz 

NOTE: The stream carries the coding parameters used for the representation of all remaining audio components 
that are present in the original LPCM audio and are not represented in the core audio stream. 

duration: time represented by one decoded audio frame, may be represented in audio samples per channel at a specific 
audio sampling frequency or in seconds 

extension: audio stream component providing a specific enhancement or coding profile 
extension substream: audio stream component that adheres to the definitions described in clause 7 
ExtractBits (w): pseudo-function that extracts next n consecutive bits from the stream 
False: Boolean logic value = 

LBR: DTS-HD extension used to implement the low bit rate coding profile 
Linear Pulse Code Modulated (LPCM): sequence of digital audio samples 
main audio: default audio presentation 

PES payload: portion of the PES packet following the PES header 
primary audio channels: audio channels encoded in the DTS core 
primary audio presentation: synonymous with main audio 

QMF bank: specific filtering structure that provides the means of translating the time domain signal into the multiple 
subband domain signals 

secondary audio: auxiliary or supplemental program 

SPDIF: generically referring to S/PDIF or TOSLINK serial audio interfaces 

substream: sequence of synchronized frames comprising one of the logical components of the audio stream 

true: Boolean logic value = 1 

uimsbf: unsigned integer most significant bit first 

vector quantization: joint quantization of a block of signal samples or a block of signal parameters 

X96: extension that contains the spectrum beyond 24 kHz to compliment a specific set of coded audio stored at a 
sampling frequency of 48 kHz, permitting source material originally sampled at 96 kHz to be played back on both 
4 kHz and 96 kHz capable systems 

XBR: extension containing resolution enhancements to the audio elements stored in the core substream 
XCH: extension that adds a centre surround channel 
XLL: lossless audio coding extension 
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XXCH: extension that can add up to 32 channels at arbitrary locations to a DTS core (which is limited to up to 

5.1 channels) in either the core substream, or in an extension substream 

3.2 Abbreviations 



For the purposes of the present document, the following abbreviations apply: 



ADPCM 


Adaptive Differential Pulse Code Modulation 


ASPF 


Audio Sync Word Insertion Flag 


CA 


Coherent Acoustics 


CBR 


Constant Bit Rate 


CRC 


Cycle Redundancy Check 


DAC 


Digital to Analogue Converter 


DNG 


Dialog Normalization Gain 


DRC 


Dynamic Range Control 


DSP 


Digital Signal Processor 


ES 


Extended Surround 


FIR 


Finite Impulse Response 


HCRC 


Header CRC Check Bytes 


HFLAG 


Predictor History Flag Switch 


LAR 


Linear Area Ratios (as defined in 8.4.3 prior to usage) 


LFE 


Low Frequency Effects 


LFF 


Low Frequency Effects Flag 


LPCM 


Linear Pulse Code Modulation 


LR 


Left Right 


LSB 


Least Significant Bit 


MDCT 


Modified Discrete Cosine Transform 


MIPS 


Million Instructions per Second 


MSB 


Most Significant Bit 


PCM 


Pulse Code Modulation 


PES 


Packetized Elementary Stream 


PID 


Package Identifier 


PMT 


Program Map Table 


QMF 


Quadrature Mirror Filter 


VBR 


Variable Bit Rate 


VQ 


Vector-Quantized 



3.3 Document Conventions 

A number of conventions are applied throughout the present document: 

• In parameter descriptions, most significant byte and most significant bit first, (big endian), convention is 
utilized unless otherwise specified. 

• The existence of many bit fields is determined by conditions of prior bits in the stream. As such, in many cases 
the bit stream elements are described using standard 'C conventions, with a pseudo function 
ExtractBits ( ) representing the bit field of interest for that description. 

• Bit field descriptions are described in presentation order. 

• In many cases, variable names are assigned to fields as they are being described. In some cases, the variable 
may be modified during definition, such as: 

nuFieldSize = ExtractBits (4 ) +1 
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4 Summary 
4.1 Overview 

The DTS Coherent Acoustics coding system is designed to deliver advanced features while supporting compatibility to 
a large installed base of decoders. The addition of the extension substream to the coding system permits a great deal of 
flexibility in defining new coding profiles and major feature enhancements. By adding a low bit rate profile and a 
lossless coding extension, the augmented system is able to address a broader range of applications. Metadata and new 
extensions increase versatility, such as embedded replacement channels, dynamic mixing coefficients and alternate 
channel maps. Instantiations of this new coding system exist in the market today and are known as DTS-HD™, 
including DTS-HD High Resolution Audio™ and DTS-HD Master Audio™. 

Due to the popularity of the 5.1 channel sound tracks in the movie industry and in the multichannel home audio market, 
DTS Coherent Acoustics is delivered in the form of a core audio (for the 5.1 channels) plus optional extended audio (for 
the rest of the DTS Coherent Acoustics). The 5.1 channel audio consists of up to five primary audio channels with 
frequencies lower than 24 kHz plus a possible Low Frequency Effect (LFE) channel (the 0.1 channel). This implies that 
the frequency components higher than 24 kHz for the five primary audio channels and all frequency components of the 
remaining two channels are carried in the extended audio. 

• Core Audio: 

Up to 5 primary audio channels (frequency components below 24 kHz). 

Up to 1 low frequency effect (LFE) channel. 

Optional information such as time stamps and user information. 

One channel extension ((XCH or XXCH) up to 7.1 channels) or a frequency extension (X96) for up to 
5.1 channel. 

• Extended Audio: 

Up to 32 additional full bandwidth channels (frequency components below 24 kHz). 

Frequency components above 24 kHz for the primary and extended audio channels. 

Resolution enhancement for primary channels in the core audio, including up to bit exact replication of 
the audio source material (lossless compression). 

Hierarchically embedded downmix capability. 

• Lossless coding algorithm: 

Capable of extending the original DTS core to render a bit-exact reproduction of the studio master in a 
bit-stream with inherent legacy compatibility. 

Capable of operating in a stand-alone mode for maximum efficiency when legacy compatibility is not 
required. 

With the core + extension structure, the core is always coded without a reference to the extension, allowing the core 
substream to be decoded independently. A sophisticated decoder, however, can first decode the 5.1 core audio bits and 
then proceed to decode the extension substream to enhance sonic accuracy (up to and including lossless), add channels 
and extend the audio sampling frequency. 

When bit efficiency is paramount and legacy compatibility is not an issue, the use of the extension substream with the 
LBR extension provides a flexible and robust solution. Some of the advantages of the low bit rate profile are: 

• Optimized for bit rate sensitive broadcasting and internet streaming. 

• 5.1 channels of broadcast quality audio in as low as 192 Kbits/s. 

• Up to 7. 1 channel capability. 
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• Same rich metadata feature set as is available for other DTS formats. 



4.2 Organization of the present document 

The present document will describe the fundamental elements of this coding system, including: 

• Clause 5: Core Audio - Bit stream syntax of the DTS core header and coding of DTS Coherent Acoustics 
(a.k.a. core, or core audio) and DTS core metadata extensions. 

• Clause 6: Core Extensions - Bit-stream syntax of DTS core extensions used to enhance channel count and 
sampling frequency. 

• Clause 7: Extension Substream - Bit stream syntax of the header and metadata components. 

• Clause 8: Lossless Coding Extension - Bit stream syntax and coding. 

• Clause 9: LBR Coding Extension - Bit stream syntax and coding. 

• In the annexes: 

Supplemental algorithms useful in decoding the DTS bitstream. 
Look-up tables. 

Information necessary to embed the DTS core and extension substream components into ISO media files 
(in accordance with ISO/IEC 14496-12 [i.l]). 

Guidance on embedding the DTS core and extension substream components into MPEG-2 broadcast 
streams, referring to TS 101 154 [2] and EN 300 468 [1] and in accordance with ISO/IEC 13818-1 [i.2]. 



5 Core Audio 

The DTS core encoder delivers 5.1 channel audio at 24 bits per sample with a sampling frequency of up to 48 kHz. As 
shown in Figure 5-1, the audio samples of a primary channel are split and decimated by a 32-band QMF bank into 
32 subbands. The samples of each subband goes through an adaptive prediction process to check if the resultant 
prediction gain is large enough to justify the overhead of transferring the coefficients of prediction filter. The prediction 
gain is obtained by comparing the variance of the prediction residual to that of the subband samples. If the prediction 
gain is big enough, the prediction residual is quantified using mid-tread scalar quantization and the prediction 
coefficients are Vector-Quantized (VQ). Otherwise, the subband samples themselves are quantized using mid-tread 
scalar quantization. In the case of low bit rate applications, the scalar quantization indexes of the residual or subband 
samples are further encoded using Huffman code. When the bit rate is low, Vector Quantization (VQ) may also be used 
to quantize samples of the high-frequency subbands for which the adaptive prediction is disabled. In very low bit rate 
applications, joint intensity coding and sum/difference coding may be employed to further improve audio quality. The 
optional LFE channel is compressed by: low-pass filtering, decimation and mid-tread scalar quantization. 
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NOTE: The dotted lines indicate optional operations and dash dot lines bit allocation control. 

Figure 5-1 : Compression of a primary audio channel 



DTS 

• Bit 
Stream 



5.1 Frame structure and decoding procedure 

DTS bit stream is a sequence of synchronized frames, each consisting of the following fields (see Figure 5-2). A pseudo 
code overview of the main function calls is listed in clause C.l. 

• Synchronization Word: Synchronize the decoder to the bit stream. 

• Frame Header: Carries information about frame construction, encoder configuration, audio data arrangement 
and various operational features. See clause 5.3 for details of the Frame Header. 

• Subframes: Carries core audio data for the 5.1 channels. Each frame may have up to 16 subframes. See clause 
5.4 for details of the primary audio coding header routines. 

• Optional Information: Carries auxiliary data such as time code, which is not intrinsic to the operation of the 
decoder but may be used for post processing routines. Some optional metadata has been defined in clause 5.7. 

• Extended Audio: Carries possible extended channels and frequency bands of the primary audio channels as 
well as all frequency components of channels beyond 5.1. These extensions are described in clause 6. 

Each subframe contains data for audio samples of the 5.1 channels covering a time duration of up to that of the subband 
analysis window and can be decoded entirely without reference to any other subframes. A subframe consists of the 
following fields (see Figure 5-3): 

• Side Information: Relays information about how to decode the 5.1 channel audio data. Information for joint 
intensity coding is also included here. 

• High Frequency VQ: A small number of high frequency subbands of the primary channels may be encoded 
using VQ. In this case, the samples of each of those subbands within the subframe are encoded as a single VQ 
address. 

• Low Frequency Effect Channel: The decimated samples of the LFE channel are carried as 8-bit words. 

• Subsubframes: All subbands, except those high-frequency VQ encoded ones, are encoded here in up to 
4 subsubframes. 
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□ □□ 




Figure 5-2: DTS frame structure 
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Figure 5-3: Subframe structure 



5.2 Synchronization 

DTS bit streams consists of a sequence of audio frames, usually of equal size and each frame begins with a 32-bit 
synchronization word. 

The first decoding step is to search the input bit stream for SYNC. In order to reduce the probability of false 
synchronization, 6 bits after SYNC in the bit stream may be further checked, (FTYPE and SHORT), since they usually 
do not change for normal frames (they do carry useful information about frame structure). These 6 bits should be 0x3f 
(the binary 1 1 1 1 1 1) for normal frames and are called synchronization word extension. Concatenating them with SYNC 
gives an extended synchronization word (32 + 6 = 38 bits). 



5.3 Frame header 

The frame header consists of a bit stream header and a primary audio coding header. The bit stream header provides 
information about the construction of the frame, the encoder configuration such as core source sampling frequency and 
various optional operational features such as embedded dynamic range control. The primary audio coding header 
specifies the packing arrangement and coding formats used at the encoder to assemble the audio coding side 
information. Many elements in the headers are repeated for each separate audio channel. 
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5.3.1 Bit stream header 



Table 5-1 : Bit-stream header 



Core Frame Header 


Size (bits) 


SYNC = ExtractBits (32) ; //0x7FFE8001 


32 


FTYPE = ExtractBits (1) ; 


1 


SHORT = ExtractBits (5) ; 


5 


CPF = ExtractBits (1) ; 


1 


NBLKS = ExtractBits (7) ; 


7 


FSIZE = ExtractBits (14) ; 


14 


AMODE = ExtractBits (6) ; 


6 


SFREQ = ExtractBits (4) ; 


4 


RATE = ExtractBits (5) ; 


5 


FixedBit = ExtractBit ( 1 ) ; 


1 


DYNF = ExtractBits (1) ; 


1 


TIMEF = ExtractBits (1) ; 


1 


AUXF = ExtractBits (1) ; 


1 


HDCD = ExtractBits (1) ; 


1 


EXT AUDIO ID = ExtractBits (3) ; 


3 


EXT_AUDIO = ExtractBits (1) ; 


1 


ASPF = ExtractBits (1) ; 


1 


LFF = ExtractBits (2) ; 


2 


HFLAG = ExtractBits (1) ; 


1 


if ( CPF == 1 ) // Present only if CPF=1. 


1 6 


HCRC = ExtractBits (16) ; 


FILTS = ExtractBits (1) ; 


1 


VERNUM = ExtractBits (4) ; 


4 


CHIST = ExtractBits ( 2 ) ; 


2 


fcmk = iixtractbits { 3 ) ; 


Q 
O 


SUMF = ExtractBits (1) ; 


1 


SUMS = ExtractBits (1) ; 


1 


switch (VERNUM) { 




case 6 : 




DIALNORM = ExtractBits (4) ; 




DNG = - (16+DIALNORM) ; 




break ; 




case7 : 




DIALNORM = ExtractBits (4) ; 


4 


DNG = -DIALNORM; 




break ; 




default : 




UNSPEC = ExtractBits (4) ; 




DNG = DIALNORM = ; 




break; 

} 





SYNC (Sync word) 

The sync word denotes the start of an audio frame, or a component thereof. For a core substream, the sync word is 
0x7ffe8001. A core frame can also exist in an extension substream, where the sync word is 0x02b9261. 

FTYPE (Frame type) 

This field indicates the type of current frame: 
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Table 5-2: Frame Type 



FTYPE 


Frame Type 


1 


Normal frame 





Termination frame 



Termination frames are used when it is necessary to accurately align the end of an audio sequence with a video frame 
end point. A termination block carries nx32 core audio samples where block length n is adjusted to just fall short of the 
video end point. Two termination frames may be transmitted sequentially to avoid transmitting one excessively small 
frame. 

SHORT (Deficit Sample Count) 

This field defines the number of core samples by which a termination frame falls SHORT of the normal length of a 
block. A block = 32 PCM core samples per channel, corresponding to the number of PCM core samples that are fed to 
the core filter bank to generate one subband sample for each subband. A normal frame consists of blocks of 32 PCM 
core samples, while a termination frame provides the flexibility of having a frame size precision finer than the 32 PCM 
core sample block. On completion of a termination frame, (SHORT+1) PCM core samples shall be padded to the output 
buffers of each channel. The padded samples may be zeros or they may be copies of adjacent samples. 



Table 5-3: Deficit Sample Count 



SHORT 


Valid Value or Range of SHORT 


1 


[0,30] 





31 (indicating a normal frame). 



CPF (CRC Present Flag) 

This field should always be set to 0. In the event that a bitstream was created prior to the occlusion of this bit from the 
bitstream definition, all decoders are required to test this flag. 

NBLKS (Number of PCM Sample Blocks) 

This field indicates that there are (NBLKS+1) blocks (a block = 32 PCM core samples per channel, corresponding to 
the number of PCM samples that are fed to the core filter bank to generate one subband sample for each subband) in the 
current frame (see note). The actual core encoding window size is 32 x (NBLKS+1) PCM samples per channel. Valid 
range for NBLKS: 5 to 127. Invalid range for NBLKS: to 4. For normal frames, this indicates a window size of either 
4 096, 2 048, 1 024, 512, or 256 samples per channel. For termination frames, NBLKS can take any value in its valid 
range. 

NOTE: When frequency extension stream (X96) is present, the PCM core samples represent the samples at the 
output of the decimator that precedes the core encoder. This k-times decimator translates the original 
PCM source samples with the sampling frequency of Fs_src= kxSFREQ to the core PCM samples 
(Fs_core=SFREQ) suitable for the encoding by the core encoder. The core encoder can handle sampling 
frequencies SFREQ < 48 kHz and consequently: 

k=2 for 48 kHz < Fsrc < 96 kHz; and 

k=4 for 96 kHz < Fsrc < 1 92 kHz 

FSIZE (Primary Frame Byte Size) 

(FSIZE+1) is the total byte size of the current frame including primary audio data as well as any extension audio data. 
Valid range for FSIZE: 95 to 16 383. Invalid range for FSIZE: to 94. 

AMODE (Audio Channel Arrangement) 

Audio channel arrangement that describes the number of audio channels (CHS) and the audio playback arrangement 
(see Table 5-4). Unspecified modes may be defined at a later date (user defined code) and the control data required to 
implement them, i.e. channel assignments, down mixing etc., can be uploaded from the player platform. 
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Table 5-4: Audio channel arrangement (AMODE) 



AMODE 


CHS 


Arranopmpnt 

1 1 \A 1 lUvl 1 1 v III 


ohonnnnn 


■ 


A 


nhnnnnm 


o 

C- 


r\ t D ^UUdl IIIUIIU^ 




O 

C- 


1 _l R fctprprA 
i— t n ^oitJicuy 


nhnnnm 1 


2 




ohooni nn 

V/UV/V/U 1 uu 


o 


1 T -uRT Hpft anH rinht tntah 
i— i tni ^itfiLdiiuiiyiiL lu Lai ) 


ohoooi m 

V/UV/V/«J I U I 


o 


o + i_ + n 


ohoooi 1 n 

\JU\J\J\J I I u 


o 


1 , R , C 

l_ T 1 IT O 


OhOOOl 1 1 


4 


\J T 1— T rlT O 


ohoni nnn 

\JU\J\J 1 uuu 


4 


1 j. R_t_ Q| j.<?R 

1 T 1 IT OL + OI 1 


0b001001 


5 


C + L + R+ SL+SR 


0b001010 


6 


CL + CR + L + R + SL + SR 


0b001011 


6 


C + L + R+ LR + RR + OV 


oboonoo 


6 


CF+ CR+LF+ RF+LR + RR 


0b001101 


7 


CL + C + CR + L+R + SL + SR 


oboomo 


8 


CL + CR + L + R + SL1 + SL2+ SR1 + SR2 


0b001 1 1 1 


8 


CL + C+ CR + L + R + SL + S+ SR 


0b010000 - 0b1 11111 




User defined 


NOTE: L = left, R = right, C = centre, S = surround, F = front, R = rear, T = total, OV = overhead, A = first 


mono channel, B = second mono channel. 



SFREQ (Core Audio Sampling Frequency) 

This field specifies the sampling frequency of audio samples in the core encoder, based on Table 5-5. When the source 
sampling frequency is beyond 48 kHz the audio is encoded in up to 3 separate frequency bands. The base-band audio, 
for example, kHz to 16 kHz, kHz to 22,05 kHz or kHz to 24 kHz, is encoded and packed into the core audio data 
arrays. The SFREQ corresponds to the sampling frequency of the base-band audio. The audio above the base-band (the 
extended bands), for example, 16 kHz to 32 kHz, 22,05 kHz to 44,1 kHz, 24 kHz to 48 kHz, is encoded and packed into 
the extended coding arrays which reside at the end of the core audio data arrays. If the decoder is unable to make use of 
the high sample rate data this information may be ignored and the base-band audio converted normally using a standard 
sampling rates (32 kHz, 44,1 kHz or 48 kHz). If the decoder is receiving data coded at sampling rates lower than that 
available from the system then interpolation (2x or 4x) will be required (see Table 5-6). 



Table 5-5: Core audio sampling frequencies 



SFREQ 


Source Sampling Frequency 


TimeStamp SampleRate 


ObOOOO 


Invalid 




0b0001 


8 kHz 


8 000 


0b0010 


16 kHz 


16 000 


0b0011 


32 kHz 


32 000 


0b0100 


Invalid 




0b0101 


Invalid 




0b01 10 


11,025 kHz 


11 025 


0b01 1 1 


22,05 kHz 


22 050 


0b1000 


44,1 kHz 


44 100 


0b1001 


Invalid 




0b1010 


Invalid 




0b1011 


12 kHz 


12 000 


0b1 100 


24 kHz 


24 000 


0b1 101 


48 kHz 


48 000 


0b1 1 10 


Invalid 




0b1 1 1 1 


Invalid 
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Table 5-6: Sub-sampled audio decoding for standard sampling rates 



Core Audio Sampling 


Hardware Sampling 


Required 


Frequency 


Frequency 


Filtering 


8 kHz 


32 kHz 


4 x Interpolation 


16 kHz 


32 kHz 


2 x Interpolation 


32 kHz 


32 kHz 


none 


11 kHz 


44,1 kHz 


4 x Interpolation 


22,05 kHz 


44,1 kHz 


2 x Interpolation 


44,1 kHz 


44,1 kHz 


none 


12 kHz 


48 kHz 


4 x Interpolation 


24 kHz 


48 kHz 


2 x Interpolation 


48 kHz 


48 kHz 


none 



RATE (Transmission Bit Rate) 

RATE specifies the targeted transmission data rate for the current frame of audio (see Table 5-7). The open mode 
allows for bit rates not defined by the table. 

Table 5-7: RATE parameter versus targeted bit-rate 



RATE 


Targeted Bit Rate 
[Kbits/s] 


ObOOOOO 


32 


0b00001 


56 


0b00010 


64 


obooon 


96 


0b00100 


112 


0b00101 


128 


0b00110 


192 


oboom 


224 


0b01000 


256 


0b01001 


320 


0b01010 


384 


0b01011 


448 


0b01100 


512 


0b01 101 


576 


obomo 


640 


0b01 1 1 1 


768 


OblOOOO 


960 


0b10001 


1 024 


0b10010 


1 152 


0b10011 


1 280 


0b10100 


1 344 


0b10101 


1 408 


0b101 10 


1 411,2 


0b1 01 1 1 


1 472 


0b11000 


1 536 


0b1 1 101 


open 


other 
codes 


invalid 



The targeted transmission rate may be greater than or equal to the actual bit rate. 
FixedBit (reserved field) 
This field is always set to 0. 
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DYNF (Embedded Dynamic Range Flag) 

DYNF indicates if embedded dynamic range coefficients are included at the start of each subframe. Dynamic range 
correction may be implemented on all channels using these coefficients for the duration of the subframe. 



Table 5-8: Embedded Dynamic Range Flag 



DYNF 


Dynamic Range Coefficients 





not present 


1 


present 



TIMEF (Embedded Time Stamp Flag) 

This field indicates if embedded time stamps are included at the end of the core audio data. 



Table 5-9: Embedded Time Stamp Flag 



TIMEF 


Time Stamps 





not present 


1 


present 



AUXF (Auxiliary Data Flag) 

This field indicates if auxiliary data bytes are appended at the end of the core audio data. 



Table 5-10: Auxiliary Data Flag 



AUXF 


Auxiliary Data Bytes 





not present 


1 


present 



HDCD 

The source material is mastered in HDCD format if HDCD = 1 and otherwise HDCD = 0. 
EXT_AUDIO_ID (Extension Audio Descriptor Flag) 

This flag has meaning only if the EXT_AUDIO = 1 (see Table 5-1 1) and then it indicates the type of data that has been 
placed in the extended audio. 



Table 5-11 : Extension Audio Descriptor Flag 



EXT AUDIO I 
D 


Type of Extension Data 





Channel Extension (XCH) 


1 


Reserved 


2 


Frequency Extension (X96) 


3 


Reserved 


4 


Reserved 


5 


Reserved 


6 


Channel Extension (XXCH) 


7 


Reserved 



EXT_AUDIO (Extended Coding Flag) 

Indicates if extended audio coding data are present after the core audio data. Extended audio data will include the data 
for the extended bands of the 5 normal primary channels as well as all bands of additional audio channels. To simplify 
the process of implementing a 5.1 channel 48 kHz decoder, the extended coding data arrays are placed at the end of the 
core audio array. 
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Table 5-12: Extended Coding Flag 



EXT_AUDIO 


Extended Audio Data 





not present 


1 


present 



ASPF (Audio Sync Word Insertion Flag) 

Indicates how often the audio data check word DSYNC (OxFFFF Extension Audio Descriptor Flag) occurs in the data 
stream. DSYNC is used as a simple means of detecting the presence of bit errors in the bit stream and is used as the 
final data verification stage prior to transmitting the reconstructed PCM words to the DACs. 



Table 5-13: Audio Sync Word Insertion Flag 



ASPF 


DSYNC Placed at End of Each 





Subframe 


1 


Subsubframe 



LFF (Low Frequency Effects Flag) 

Indicates if the LFE channel is present and the choice of the interpolation factor to reconstruct the LFE channel 
(see Table 5-14). 



Table 5-14: Flag for LFE channel 



LFF 


LFE Channel 


Interpolation Factor 





not present 




1 


Present 


128 


2 


Present 


64 


3 


Invalid 





HFLAG (Predictor History Flag Switch) 

If frames are to be used as possible entry points into the data stream or as audio sequence\start frames the ADPCM 
predictor history may not be contiguous. Hence these frames can be coded without the previous frame predictor history, 
making audio ramp-up faster on entry. When generating ADPCM predictions for current frame, the decoder will use 
reconstruction history of the previous frame if HFLAG = 1 . Otherwise, the history will be ignored. 

HCRC (Header CRC Check Bytes) 

If CPF = 1 then HCRC shall be extracted from the bitstream. The CRC value test shall not be applied. 
FILTS (Multirate Interpolator Switch) 

Indicates which set of 32-band interpolation FIR coefficients is to be used to reconstruct the subband audio (see 
Table 5-15). 



Table 5-15: Multirate interpolation filter bank switch 



FILTS 


32-band Interpolation Filter 





Non-perfect Reconstruction 


1 


Perfect Reconstruction 



VERNUM (Encoder Software Revision) 

VERNUM indicates of the revision status of the encoder software (see Table 5-16). In addition, it is used to indicate the 
presence of the dialog normalization parameters (see Table 5-20). 
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Table 5-16: Encoder software revision 



VERNUM 


Encoder Software Revision 


0to6 


Future revision (compatible with the present document) 


7 


Current 


8 to 15 


Future revision (incompatible with the present document) 


NOTE: If the decoder encounters the DTS stream with the VERNUM>7 and the 
decoder is not designed for that specific encoder software revision then it 
shall mute its outputs. 



CHIST (Copy History) 

This field indicates the copy history of the audio. Because of the copyright regulations, the exact definition of this field 
is deliberately omitted. 

PCMR (Source PCM Resolution) 

This field indicates the quantization resolution of source PCM samples (see Table 5-17). The left and right surrounding 
channels of the source material are mastered in DTS ES format if ES = 1 and otherwise if ES = 0. 



Table 5-17: Quantization resolution of source PCM samples 



PCMR 


Source PCM Resolution 


ES 


ObOOO 


1 6 bits 





0b001 


1 6 bits 


1 


0b010 


20 bits 





0b011 


20 bits 


1 


0b110 


24 bits 





0b101 


24 bits 


1 


Others 


Invalid 


invalid 



SUMF (Front Sum/Difference Flag) 

This parameter indicates if front left and right channels are sum-difference encoded prior to encoding (see Table 5-18). 
If set to zero no decoding post processing is required at the decoder. 

Table 5-18: Sum/difference decoding status of front left and right channels 



SUMF 


Front Sum/Difference Encoding 





L = L, R = R 


1 


L = L + R, R = L- R 



SUMS (Surrounds Sum/Difference Flag) 

This parameter indicates if left and right surround channels are sum-difference encoded prior to encoding (see 
Table 5-19). If set to zero no decoding post processing is required at the decoder. 

Table 5-19: Sum/difference decoding status of left and right surround channels 



SUMS 


Surround Sum/Difference Encoding 





Ls = Ls, Rs = Rs 


1 


Ls = Ls + Rs, Rs = Ls - Rs 



DIALNORM/UNSPEC (Dialog Normalization /Unspecified) 

For the values of VERNUM = 6 or 7 this 4-bit field is used to determine the dialog normalization parameter. For all 
other values of the VERNUM this field is a place holder that is not specified at this time. 

The Dialog Normalization Gain (DNG), in dB, is specified by the encoder operator and is used to directly scale the 
decoder outputs samples. In the DTS stream the information about the DNG value is transmitted by means of combined 
data in the VERNUM and DIALNORM fields (see Table 5-20). 
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For all other values of the VERNUM (i.e. 0, 1, 2, 3, 4, 5, 8, 9, . . . 15) the UNSPEC 4-bit field should be extracted but 
ignored by the decoder. In addition, for these VERNUM values, the Dialog Normalization Gain should be set to 
i.e. DNG=0 indicates No Dialog Normalization. 

Table 5-20: Dialog Normalization Parameter 



DIALNORM 


VERNUM 


DNG (dB) 





7 


L 


1 


7 


-1 


2 


7 


-2 


3 


7 


-3 


4 


7 


-4 


5 


7 


-5 


6 


7 


-6 


7 


7 


-7 


8 


7 


-8 


9 


7 


-9 


10 


7 


-10 


11 


7 


-1 1 


12 


7 


-12 


13 


7 


-13 


14 


7 


-14 


15 


7 


r -15 





6 


-16 


1 


6 


-17 


2 


6 


-18 


3 


6 


-19 


4 


6 


-20 


5 


6 


-21 


6 


6 


-22 


7 


6 


l - 23 


8 


6 


-24 


9 


6 


-25 


10 


6 


-26 


11 


6 


-27 


12 


6 


-28 


13 


6 


-29 


14 


6 


-30 


15 


6 


-31 



5.3.2 Primary Audio Coding Header 



Table 5-21 : Primary audio coding header 



Core audio coding header 


Size (bits) 


SUBFS = ExtractBits (4) ; 




nSUBFS = SUBFS + 1; 


4 


PCHS = ExtractBits (3) ; 




nPCHS = PCHS + 1; 


3 


for (ch=0; ch<nPCHS; ch++) { 




SUBS[ch] = ExtractBits (5) ; 


5 bits per 


nSUBS [ch] = SUBS [ch] + 2 ; 

} 

for (ch=0; ch<nPCHS; ch++) { 


channel 




VQSUB[ch] = ExtractBits (5) ; 


5 bits per 


nVQSUB[ch] = VQSUB [ch] + 1 ; 

} 

for (ch=0; ch<nPCHS; ch++) { 


channel 


3 bits per 
channel 


JOINX[ch] = ExtractBits (3) ; 

} 


for (ch=0; ch<nPCHS; ch++) { 


2 bits per 
channel 


THUFF [ch] = ExtractBits (2) ; 

} 

for (ch=0; ch<nPCHS; ch++) { 


3 bits per 
channel 


SHUFF[ch] = ExtractBits (3) ; 

} 
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Core audio coding header 



Size (bits) 



for (ch=0; ch<nPCHS; ch++) { 
BHUFF [ch] = ExtractBits (3) ; 

} 

// ABITS=1: 
n=0; 

for (ch=0; ch<nPCHS; ch++) 

SEL[ch][n] = ExtractBits (1) ; 

// ABITS = 2 to 5: 

for (n=l; n<5; n++) 

for <ch=0; ch<nPCHS; ch++) 
SEL[ch][n] = ExtractBits (2) ; 

// ABITS = 6 to 10: 

for (n=5; n<10; n++) 

for (ch=0; ch<nPCHS; ch++) 

SEL[ch] [n] = ExtractBits (3) ; 

// ABITS = 11 to 2G: 

for (n=10; n<26; n++) 

for (ch=0; ch<nPCHS; ch++) 



3 bits per 
channel 



variable bits 



SEL [ch] [n] 



// Not transmitted, set to zero. 



n = 0; // ABITS = 1 

for (ch=0; ch<nPCHS; ch++) 
if ( SEL [ch] [n] == ) { 
used) 

// Extract ADJ index 
ADJ = ExtractBits (2) ; 
// Look up ADJ table 
arADJ[ch] [n] = Adj Table [ADJ] ; 



// Transmitted only if SEL=0 (Huffman code 



} 



// ABITS = 2 to 5: 



for (n=l; n<5; n++){ 

for <ch=0; ch<nPCHS; ch++) { 

if ( SEL[ch] [n] < 3 ) { // Transmitted only when SEL<3 
// Extract ADJ index 
ADJ = ExtractBits (2) ; 
// Look up ADJ table 
arADJ[ch] [n] = Adj Table [ADJ] ; 

} 



for (n=5; n<10; n++) { // ABITS = 6 to 10: 

for <ch=0; ch<nPCHS; ch++) { 

if ( SEL [ch] [n] < 7 ) { // Transmitted only when SEL<7 
// Extract ADJ index 

ADJ = ExtractBits (2 ) ; Look up ADJ table 



2 bits per 
occasion 



arADJ[ch] [n] = Adj Table [ADJ] ; // 



if ( CPF= 
AHCRC = 



=1 ) // Present only if CPF=1 
ExtractBits (1G) ; 



16 



SUBFS (Number of Subframes) 

This field indicates that there are nSUBFS = SUBFS+1 audio subframes in the core audio frame. SUBFS is valid for all 
audio channels. 

PCHS (Number of Primary Audio Channels) 

This field indicates that there are nPCHS = PCHS+1 < 5 primary audio channels in the current frame. If AMODE flag 
indicates more than five channels apart from LFE, the additional channels are the extended channels and are packed 
separately in the extended data arrays. 

SUBS (Subband Activity Count) 

This field indicates that there are nSUBS[ch] = SUBS[ch]+2 active subbands in the audio channel ch. Samples in 
subbands above nSUBS[ch] are zero, provided that intensity coding in that subband is disabled. 
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VQSUB (High Frequency VQ Start Subband) 

This field indicates that high frequency samples starting from subband nVQSUB[ch]=VQSUB[ch]+l are VQ encoded. 
High frequency VQ is used only for high frequency subbands, but it may go down to low frequency subbands for such 
audio episodes as silence. In case of insufficient MIPS, the VQs for the highest frequency subbands may be ignored 
without causing audible distortion. 

JOINX (Joint Intensity Coding Index) 

JOINX[ch] indicates if joint intensity coding is enabled for channel ch and which audio channel is the source channel 
from which channel ch will copy subband samples (Table 5-22). It is assumed that the source channel index is smaller 
than that of the intensity channel. 



Table 5-22: Joint Subband Coding Status and Source Channels 



JOINX[ch] 


Joint Intensity 


Source Channel 





Disabled 




>0 


Enabled 


JOINX[ch] 



THUFF (Transient Mode Code Book) 

This field indicates which Huffman codebook was used to encode the transient mode data (see Table 5-23). 

Table 5-23: Selection of Huffman Codebook for Encoding TMODE 



THUFF[ch] 


Huffman Codebook 





A4 


1 


B4 


2 


C4 


3 


D4 



SHUFF (Scale Factor Code Book) 

The scale factors of a channel are quantized nonlinearly using either a 6-bit (64-level, 2,2 dB per step) or a 7-bit 
(128-level, 1,1 dB per step) square root table, depending on the application. The quantization indexes may be further 
compressed by one of the five Huffman codes and this information is transmitted to the decoder by SHUFF[ch] (see 
Table 5-24). 

Table 5-24: Code Books and Square Root Tables for Scale Factors 



SHUFF[ch] 


Code Book 


Square Root Table 





SA129 


6 bit (clause D.1.1) 


1 


SB129 


6 bit (clause D.1.1) 


2 


SC129 


6 bit (clause D.1.1) 


3 


SD129 


6 bit (clause D.1.1) 


4 


SE129 


6 bit (clause D.1.1) 


5 


6-bit linear 


6 bit (clause D.1.1) 


6 


L 7-bit linear 


7 bit (clause D.1.2) 


7 


Invalid 


Invalid 
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BHUFF (Bit Allocation Quantizer Select) 

Indicates the codebook that was used to encode the bit allocation index ABITS (to be transmitted later) (see 
Table 5-25). 

Table 5-25: Codebooks for Encoding Bit Allocation Index ABITS 



BHUFF[ch] 


Codebook (clause D.5.6) 





A12 


1 


B12 


2 


C12 


3 


D12 


4 


E12 


5 


Linear 4-bit 


6 


Linear 5-bit 


7 


Invalid 



SEL (Quantization Index Codebook Select) 

After subband samples are quantized using a mid-tread linear quantizer, the quantization indexes may be further 
encoded using either entropy (Huffman) or block coding in order to reduce bit rate. Therefore, the subband samples 
may appear in the bitstream as plain quantization indexes (no further encoding), entropy (Huffman) codes, or block 
codes. For channel ch, the selection of a particular codebook for a mid-tread linear quantizer indexed by ABITS [ch] is 
transmitted to the decoder as SEL[ch] [ABITS [ch]]. No SEL is transmitted for ABITS[ch]>l 1, because no further 
encoding is used for those quantizers. The decoder can find out the particular codebook that was used using ABITS [ch] 
and SEL[ch] [ABITS [ch]] to look up Table 5-26. 



Table 5-26: Selection of Quantization Levels and Codebooks 



Quantizer Index 
(ABITS) 


Number of Index 
Quantization Levels 


Codebook Select (SEL) 


Jlj2!3l4j5_L6l7 








Not transmitted 


1 


3 


A3 


V3 














2 


5 


A5 


B5 


C5 


V5 










3 


7 


A7 


B7 


C7 


V7 










4 


9 


A9 


B9 


C9 


V9 










5 


13 


A13 


B13 


C13 


V13 










6 


17 


A17 


B17 


C17 


D17 


E17 


F17 


G17 


V17 


7 


25 


A25 


B25 


C25 


D25 


E25 


F25 


G25 


V25 


8 


33 or 32 


A33 


B33 


C33 


D33 


E33 


F33 


G33 


NFE 


9 


65 or 64 


A65 


B65 


C65 


D65 


E65 


F65 


G65 


NFE 


10 


129 or 128 


A129 


B129 


C129 


D129 


E129 


F129 


G129 


NFE 


11 


256 


NFE 
















12 


512 


NFE 
















13 


1 024 


NFE 
















14 


2 048 


NFE 
















15 


4 096 


NFE 
















16 


8 192 


NFE 
















17 


16 384 


NFE 
















18 


32 768 


NFE 
















19 


65 536 


NFE 
















20 


131 072 


NFE 
















21 


262 144 


NFE 
















22 


524 288 


NFE 
















23 


1 048 576 


NFE 
















24 


2 097 152 


NFE 
















25 


4 194 304 


NFE 
















26 


8 388 608 


NFE 
















27-32 


Invalid 


Invalid 

















NOTE: NFE = No further encoding is used to encode the linearly quantized subband samples. A,B,C,D,E,F,G = 

Subband samples are encoded by Huffman code. V = 4 subband samples are grouped and encoded using 4- 
element block code. 
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ADJ (Scale Factor Adjustment Index) 



A scale factor adjustment index is transmitted whenever a SEL value indicates a Huffman codebook. This index points 
to the adjustment values shown in Table 5-27. This adjustment value should be multiplied by the scale factor (SCALE). 

Table 5-27: Scale Factor Adjustment Index 



Scale Factor Adjustment Index (ADJ) 


Adjustment Value 





1,0000 


1 


1,1250 


2 


1,2500 


3 


1 ,4375 


NOTE: This table shows the scale factor adjustment index values if Huffman 
coding is used to encode the subband quantization indexes. 



AHCRC (Audio Header CRC Check Word) 

If CPF = 1 then AHCRC shall be extracted from the bitstream. The CRC value test shall not be applied. 

5.4 Unpack Subframes 

5.4.1 Primary Audio Coding Side Information 

Table 5-28: Core side information 



Primary audio side information 



Size (bits) 



SSC = ExtractBits (2) ; 
nSSC = SSC+1; 
PSC = ExtractBits (3) ; 
for (ch=0; ch<nPCHS; ch++) 
for (n=0; n<nSUBS [ch] ; n++) 

PMODE[ch][n] = ExtractBits ( 1 ) ; 
for <ch=0; ch<nPCHS; ch++) 
for (n=0; n<nSUBS [ch] ; n++) 

if ( PMODE[ch] [n] >0 ) { // Transmitted only when ADPCM active 

// Extract the VQindex 

nVQIndex = ExtractBits { 12 ) ; 

// Look up the VQ table for prediction coefficients. 
ADPCMCoef fVQ. Lookup (nVQIndex, PVQ [ch] [n] ) // 4 coefficients 

} 

for (ch=0; ch<nPCHS; ch++) { 

// BHUFF tells which codebook was used 
nQSelect = BHUFF [ch] ; 

// Use this codebook to decode the bit stream for ABITS [ch] [n] 
for (n=0; n<nVQSUB [ch] ; n++) // Not for VQ encoded subbands . 
QABITS. ppQ [nQSelect] - >InverseQ ( InputFrame , ABITS [ch] [n] ) 

} 

for (ch=0; ch<nPCHS; ch++){ 
for (n=0; n<NumSubband; n++) 
TMODE [ch] [n] = ; 
// Decode TMODE [ch] [n] 

if { nSSC>l ) {// Transient possible only if more than one subsubframe. 

for (ch=0; ch<nPCHS; ch++) { 
// TMODE [ch] [n] is encoded by a codebook indexed by THUFF [ch] 

nQSelect = THUFF [ch]; 

for (n=0; n<nVQSUB [ch] ; n++) // No VQ encoded subbands 
if { ABITS [ch] [n] >0 ) // Present only if bits allocated 
// Use codebook nQSelect to decode TMODE from the bit stream 
QTMODE .ppQ [nQSelect] - >InverseQ ( InputFrame , TMODE [ch] [n] ) 

} 

} 

} 

for (ch=0; ch<nPCHS; ch++) { 
// Clear SCALES 
for (n=0; n<NumSubband; n++) { 



2 
3 

1 bit per 
subband 



1 2 bits per 
occurrence 



variable bits 



variable bits 



variable bits 
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Primary audio side information 



Size (bits) 



SCALES [ch] [n] [0] = ; 
SCALES [ch] [n] [1] = ; 

} 

nQSelect = SHUFF [ch] ; // SHUFF indicates which codebook was used to encode SCALES 
// Select the root square table (SCALES were nonlinearly quantized) . 
if ( nQSelect == 6 ) 

pScaleTable = &RMS7Bit ; // 7-bit root square table 
else 

pScaleTable = &RMS6Bit ; // 6-bit root square table 

// Clear accumulation (if Huffman code was used, the difference of SCALES was 
encoded) . 

nScaleSum = ; 

// Extract SCALES for Subbands up to VQSUB[ch] 
for (n=0; n<nVQSUB [ch] ; n++) 

if ( ABITS [ch] [n] >0 ) { // Not present if no bit allocated 
// First scale factor 

// Use the (Huffman) code indicated by nQSelect to decode the quantization 

// index of SCALES from the bit stream 

QSCALES .ppQ [nQSelect] - >InverseQ ( InputFrame , nScale) ; 

// Take care of difference encoding 

if ( nQSelect < 5 ) // Huffman encoded, nScale is the difference 

nScaleSum += nScale; // of the quantization indexes of SCALES, 

else // Otherwise, nScale is the quantization 

nScaleSum = nScale; // level of SCALES. 
// Look up SCALES from the root square table 
pScaleTable ->LookUp (nScaleSum, SCALES [ch] [n] [0]) 
// Two scale factors transmitted if there is a transient 
if (TMODE [ch] [n] >0) { 

// Use the (Huffman) code indicated by nQSelect to decode the 

// quantization index of SCALES from the bit stream 
QSCALES .ppQ [nQSelect] - >InverseQ ( InputFrame , nScale) ; 
// Take care of difference encoding 

if ( nQSelect < 5 ) // Huffman encoded, nScale is the 

nScaleSum += nScale; // of SCALES, 

else // Otherwise, nScale is SCALES 

nScaleSum = nScale; // itself. 
// Look up SCALES from the root square table 
pScaleTable ->LookUp (nScaleSum, SCALES [ch] [n] [1]) 

} 

} 

// 

// High frequency VQ subbands 

// 

for (n=nVQSUB [ch] ; n<nSUBS [ch] ; n++) { 

// Use the code book indicated by nQSelect to decode 
// the quantization index of SCALES from the bit stream 
QSCALES. ppQ [nQSelect] ->InverseQ( InputFrame, nScale) ; 
// Take care of difference encoding 

if ( nQSelect < 5 ) // Huffman encoded, nScale is the 

nScaleSum += nScale; // of SCALES, 
else // Otherwise, nScale is SCALES 

nScaleSum = nScale; // itself. 
// Look up SCALES from the root square table 
pScaleTable- >LookUp (nScaleSum, SCALES [ch] [n] [0]) 

} 

} 

for (ch=0; ch<nPCHS; ch++) 

if (JOINX[ch]>0 ) // Transmitted only if joint subband coding enabled. 

J0INJ3HUFF [ch] = ExtractBits ( 3 ) ; 
for (ch=0; ch<nPCHS; ch++) 

if (JOINX[ch]>0 ) { // Only if joint subband coding enabled. 

nSourceCh = JOINX [ch] -1 ; // Get source channel. JOINX counts 
// channels as 1,2,3,4,5, so minus 1. 
nQSelect = JOINJ3HUFF [ch] ; // Select code book, 
for (n=nSUBS [ch] ; n<nSUBS [nSourceCh] ; n++) { 

// Use the code book indicated by nQSelect to decode 

// the quantization index of JOIN_SCALES 

QSCALES. ppQ [nQSelect] ->InverseQ( InputFrame, nJScale) ; 

// Bias by 64 

nJScale = nJScale + 64; 

// Look up JOIN_SCALES from the joint scale table; 
JScaleTbl .Lookup (nJScale, JOIN_SCALES [ch] [n] ) ; 

} 



3 bits per 
channel 



variable bits 
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Primary audio side information 


Size (bits) 


if { DYNF != ) { 




nlndex = ExtractBits ( 8 ) ; 




RANGEtbl. Lookup (nlndex, RANGE) ; 




// The following range adjustment is to be performed 


8 


// after QMF reconstruction 

for (ch=0; ch<nPCHS; ch++) 

for (n=0; n<nNumSamples ; n++) 

AudioCh [ch] . ReconstructedSamples [n] *= RANGE ; 

} 




if ( CPF==1 ) // Present only if CPF=1. 
SICRC = ExtractBits (16) ; 


16 



SSC (Subsubframe Count) 

Indicates that there are nSSC = SSC+1 subsubframes in the current audio subframe. 
PSC (Partial Subsubframe Sample Count) 

PSC indicates the number of subband samples held in a partial subsubframe for each of the active subbands. A partial 
subsubframe is one which has less than 8 subband samples. It exists only in a termination frame and is always at the end 
of last normal subsubframe. A DSYNC word will always occur after a partial subsubframe. 

PMODE (Prediction Mode) 

PMODE[ch][n]=l indicates that ADPCM prediction is used (active) for subband n of primary audio channel [ch] and 
PMODE[ch][n]=0 otherwise. ADPCM shall be extracted from the bit stream for all subbands, but ADPCM 
reconstruction can be limited to the lowest 20 subbands if DSP does not have enough MIPS. 

PVQ (Prediction Coefficients VQ Address) 

Indexes to the vector code book (clause D.10.1) to obtain ADPCM prediction coefficients. PVQ is transmitted only for 
subbands where ADPCM is active. 

ABITS (Bit Allocation Index) 

ABITS[ch][n] is the index to the mid-tread linear quantizer that was used to quantize the subband samples for the n th 
subband of channel ch. ABITS [ch][n] may be transmitted as either a 4-bit or 5-bit word. In the case of a 4-bit word, it 
may be further encoded using one of the 5 Huffman codes. This encoding is the same for all subbands of each channel 
and is conveyed by BHUFF as shown in Table 5-25. There is obviously no need to allocate bits for the high frequency 
subbands because they are encoded using VQ. 

TMODE (Transition Mode) 

TMODE[ch][n] indicates if there is a transient inside a subframe (subband analysis window) for subband n of channel 
ch. If there is a transient (TMODE[ch][n]>0), it further indicates that the transition occurred in subsubframe (subband 
analysis sub window) TMODE [ch][n] + 1. TMODE [ch][n] is encoded by one of the 4 Huffman codes and the selection 
of which is conveyed by THUFF (see Table 5-23). The decoder assumes that there is no transition (TMODE [ch][n]=0) 
for all subbands of all channels unless it is told otherwise by the bit stream. Transient does not occur in the following 
situations, so TMODE is not transmitted when: 

• Only one subsubframe within the current subframe. This is because the time resolution of transient analysis is 
a subsubframe (subband analysis subwindow). 

• VQ encoded high frequency subbands. If there is a transient for a subband, it would not have been VQ 
encoded. 

• Subbands without bit allocation. If no bits are allocated for a subband, there is no need for transient. 
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SCALES (Scale Factors) 

One scale factor is transmitted for subbands without transient. Otherwise two are transmitted, one for the episode before 
the transient and the other for after the transient. The quantization indexes of the scale factors may be encoded by 
Huffman code as shown in Table 5-24. If this is the case, they are difference-encoded before Huffman coding. The scale 
factors are finally obtained by using the quantization indexes to look up either the 6-bit or 7-bit square root quantization 
table according to Table 5-24. 

JOIN SHUFF (Joint Subband Codebook Select) 

If joint subband coding is enabled (JOINX[ch]>0), JOIN SHUFF[ch] selects which code book was used to encode the 
scale factors (JOIN SCALES) which will be used when copying subband samples from the source channel to the 
current channel ch. These scale factors are encoded in exactly the same way as that for SCALES, using Table 5-24 to 
look up the codebook. 

JOIN SCALES (Scale Factors for Joint Subband Coding) 

The scale factors are used to scale the subband samples copied from the source channel (JOINX[ch]-l) to the current 
channel. The index of the scale factor is encoded using the code book indexed by JOIN SHUFF[ch]. After this index is 
decoded, it is used to look up the table in clause D.4, to get the scale factor. No transient is permitted for jointly encoded 
subbands, so a single scale factor is included. The joint subbands start from the nSUBS of the current channel until the 
nSUBS of the source channel. 

RANGE (Dynamic Range Coefficient) 

Dynamic range coefficient is to allow for the convenient compression of the audio dynamic range at the output of the 
decoder. Dynamic range compression is particularly important in listening environments where high ambient noise 
levels make it impossible to discriminate low level signals without risking damaging the loudspeakers during loud 
passages. This problem is further compounded by the growing use of 20-bit PCM audio recordings which exhibit 
dynamic ranges as high as 1 10 dB. 

Each coefficient is 8-bit signed fractional Q2 binary and represents a logarithmic gain value as shown in clause D.4 
giving a range of +31,75 dB in steps of 0,25 dB. Dynamic range compression is affected by multiplying the decoded 
audio samples by the linear coefficient. 

The degree of compression can be altered with the appropriate adjustment to the coefficient values at the decoder and 
can be switched off completely by ignoring the coefficients. 

SICRC (Side Information CRC Check Word) 

If CPF = 1 then SICRC shall be extracted from the bitstream. The CRC value test shall not be applied. 



5.5 Primary Audio Data Arrays 



Table 5-29: Core audio data arrays 



Primary audio data 



Size (bits) 



for (ch=0; ch<nPCHS; ch++) 

for (n=nVQSUB [ch] ; n<nSUBS [ch] ; n++) { 

// Extract the VQ address from the bit stream 
nVQIndex = ExtractBits ( 10 ) ; 

// Look up the VQ code book for 32 subband samples. 
HFreqVQ. Lookup (nVQIndex, HFREQ [ch] [n] ) 
// Scale and take the samples 

Scale = (real) SCALES [ch] [n] [0]; // Get the scale factor 
for (m=0; m<nSSC*8; m++, nSample++) 

aPrmCh[ch] .aSubband[n] . raSample [m] = rScale*HFREQ [ch] [n] [m] ; 

} 

if { LFF>0 ) { // Present only if flagged by LFF 
// extract LFE samples from the bit stream 
for (n=0; n<2*LFF*nSSC; n++) 

LFE [n] = (signed int) (signed char ) ExtractBits ( 8 ) ; 

// Use char to get sign extension because it is 8-bit 2's compliment. 
// Extract scale factor index from the bit stream 
LFEscalelndex = ExtractBits ( 8 ) ; 

// Look up the 7-bit root square quantization table 
pLFE_RMS->LookUp (LFEscalelndex, nScale) ; 



10 bits per 
subband 



8 bits per 
sample 
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Primary audio data 


Size (bits) 


// Account for the quantizer step size which is 0.035 

rScale = nScale*0 . 035 ; 

/ / Get the actual LFE samples 

for (n=0; n<2*LFF*nSSC; n++) 

LFECh.rLFE [k] = LFE [n] *rScale ; 
// Interpolation LFE samples 

LFECh. InterpolationFIR (LFF) ; // LFF indicates which interpolation filter to use 

} 




Audio Data 


// Select quantization step size table 
if { RATE == Oxlf ) 

pStepSizeTable = &StepSizeLossLess ; // Lossless quantization 
else 

pStepSizeTable = &StepSizeLossy ; // Lossy 
// Unpack the subband samples 

for (nSubSubFrame=0 ; nSubSubFrame<nSSC ; nSubSubFrame++ ) { 
for (ch=0; ch<nPCHS; ch++) 

for (n=0; n<nVQSUB [ch] ; n++) { // Not high frequency VQ subbands 
// Select the mid-tread linear quantizer 

nABITS = ABITS [ch] [n] ; // Select the mid-tread quantizer 
pCQGroup = &pCQGroupAUDIO [nABITS- 1] ; // Select the group of 
/ / code books corresponding to the 
// the mid-tread linear quantizer. 

nNumQ = pCQGroupAUDIO [nABITS-1] .nNumQ-1 ; // Number of code 
// books in this group 

// Determine quantization index code book and its type 
// Select quantization index code book 
nSEL = SEL[ch] [nABITS-1]; 
// Determine its type 

nQType =1; // Assume Huffman type by default 
if ( nSEL==nNumQ ) { // Not Huffman type 

if { nABITS<=7 ) 

nQType =3; // Block code 

else 

nQType =2; // No further encoding 

} 

if ( nABITS==0 ) // No bits allocated 

nQType = ; 
// Extract bits from the bit stream 
switch ( nQType ) { 
case 0: //No bits allocated 

for (m=0; m<8; m++) 

AUDIO [m] = 0; 

break ; 

case 1: // Huffman code 
for (m=0; m<8; m++) 

pCQGroup- >ppQ [nSEL] - >InverseQ ( InputFrame , AUDIO [m] ) ; 
break ; 

case 2: // No further encoding 
for (m=0; m<8; m++) { 

// Extract quantization index from the bit stream 
pCQGroup- >ppQ [nSEL] - >InverseQ ( InputFrame , nCode) 
// Take care of 2's compliment 

AUDIO [m] = pCQGroup- >ppQ [nSEL] ->SignExtension (nCode) ; 

} 

break ; 
case 3 : // Block code 

pCBQ = kpCBlockQ [nABITS-1] ; // Select block code book 
m = ; 

for (nBlock=0; nBlock<2; nBlock++) { 

// Extract the block code index from the bit stream 
pCQGroup- >ppQ [nSEL] - >InverseQ ( InputFrame , nCode) 
// Look up 4 samples from the block code book 
pCBQ->LookUp (nCode, SAUDIO [m] ) 
m + = 4 ; 
} 

break; 
default: // Undefined 

printf (" ERROR : Unknown AUDIO quantization index code book."); 

} 

} 

// Account for quantization step size and scale factor 
// Look up quantization step size 
nABITS = ABITS [ch] [n] ; 

pStepSizeTable- >LookUp (nABITS, rStepSize) ; 
// Identify transient location 
nTmode = TMODE [ch] [n] ; 


variable bits 
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Primary audio data 


Size (bits) 


if { nTmode == ) // No transient 

nTmode = nSSC; 
// Determine proper scale factor 
if (nSubSubFrame<nTmode) // Pre-transient 

rScale = rStepSize * SCALES [ch] [n] [0]; // Use first scale factor 
else // After-transient 

rScale = rStepSize * SCALES [ch] [n] [1] ; // Use second scale factor 
// Adjustmemt of scale factor 

rScale *= arADJ[ch] [SEL [ch] [nABITS-1] ] ; // arADJ[] [] are assumed 1 

// unless changed by bit 

// stream when SEL indicates 

// Huffman code. 

// Scale the samples 

nSample = 8*nSubSubFrame ; // Set sample index 
for (m=0; m<8; m++, nSample++) 

aPrmCh [ch] . aSubband [n] . aSample [nSample] = rScale*AUDIO [m] ; 
// Inverse AD PCM 

if ( PMODE [ch] [n] != ) // Only when prediction mode is on. 

aPrmCh[ch] . aSubband [n] . InverseADPCM ( ) ; 
// Check for DSYNC 

if { <nSubSubFrame== (nSSC-1) ) || <ASPF==1) ) { 
DSYNC = ExtractBits (1G) ; 
if ( DSYNC != Oxffff ) 

printf ( "DSYNC error at end of subsubframe #%d" , nSubSubFrame) ; 

} 

} 





HFREQ (VQ Encoded Subbands) 

At low bit rates, some high frequency subbands are encoded using Vector Quantization (VQ). Each vector from this 
code book consists of 32 subband samples, corresponding to the maximum possible subframe (4 normal subsubframes): 

4 subsubframe x 8 samples/subsubframe = 32 samples: 

If the current subframe is short of 32 samples, the remaining samples are padded with either zeros or "don't care" and 
then vector-quantized. The vector address is then included in the bit stream. After the decoder picks up the vector 
address, it looks up the vector code book to get the 32 samples. But the decoder will only pick nSSC X 8 out of the 
32 samples and scale them with the scale factor SCALES. 

LFE (Low Frequency Effect Data) 

The presence of a LFE channel and its interpolation filter selection are flagged by LFF in the frame header (see 
Table 5-14). The number of decimated LFE samples in the current subframe is 2 x LFF x nSSC, corresponding to the 
decimation factor and the subframe size. The LFE samples are normalized with a scale factor and then quantized with a 
step size of 0,035, before being included in the bit stream as 8-bit 2's compliment. This scale factor is nonlinearly 
quantized using the 7-bit root square and then directly included in the bit stream right after the decimated LFE samples. 
Therefore, on the decoder side, these decimated LFE samples need to be adjusted by the quantization step size and scale 
factor. After this adjustment, they are used to interpolate the other samples. The choice of the interpolation filter is 
indicated by LFF as shown in Table 5-14. 

AUDIO (Audio data) 

The audio data are grouped as nSSC subsubframes, each consisting of 8 samples for each subband. Each sample was 
quantized by a mid-tread linear quantizer indexed by ABITS. The resultant quantization index may further be encoded 
by either a Huffman or block code. If it is not, it is included in the bit stream as 2's compliment. All this information is 
indicated by SEL. The (ABITS, SEL) pair then tells how the subband samples should be extracted from the bit stream 
(Table 5-26). 

The resultant subband samples are then compensated by their respective quantization step sizes and scale factors. 
Special care shall be paid to possible transient in the subframe. If a transient is flagged by TMODE, one scale factor 
should be used for samples before the transient and the other one for the after the transient. 

For some of the subbands that are ADPCM encoded, the samples of these subbands thus far obtained are actually the 
difference signals. Their real values shall be recovered through a reverse ADPCM process. 

At end of each subsubframe there may be a synchronization check word DSYNC = Oxffff depending on the flag ASPF 
in the frame header, but there shall be at least a DSYNC at the end of each subframe. 
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5.6 Unpack Optional Information 

The optional information may be included at the end of the frame following completion of the audio data arrays, 
depending on the status of the optional header flags. This data is not intrinsic to the operation of the decoder but may be 
used for post processing routines. 



Table 5-30: Core optional information 



Optional information 


Size (bits) 


if { TIMEF==1 ) // Present only when TIMEF=1. 


32 


TIMES = ExtractBits (32) ; 


if { AUXF==1 ) // Present only if AUXF=1. 




AUXCT = ExtractBits (6) ; 


6 


else 




AUXCT =0; // Clear it. 




ByteAlign = ExtractBits (0 ... 7) 


0to7 


for (int n=0; n<AUXCT; n++) 


8xAUXCT 


AUXD[n] = ExtractBits (8) ; 


bits 


if ( (CPF==1) && (DYNF ! =0 ) ) ) 


16 


OCRC = ExtractBits (IS) ; 



TIMES (Time Code Stamp) 

Time code may be used to align audio to video. 
AUXCT (Auxiliary Data Byte Count) 

The number of auxiliary data bytes to be transmitted in the following AUXD array. It shall be in the range of 1 to 63. 
ZeroPadAux (Pad for nSYNCAUX) 

The beginning of auxiliary data bytes is aligned on the 32-bit boundary from the beginning of the core stream (DWORD 
aligned). This is achieved by inserting a necessary number of zero bits after the AUXCT data. 

AUXD (Auxiliary Data Bytes) 

Unpacking of the auxiliary data is detailed in clause 5.7.1. 
OCRC (Optional CRC Check Bytes) 

If CPF = 1 then OCRC shall be extracted from the bitstream. The CRC value test shall not be applied. 

Additional optional metadata chunks may follow in an arbitrary order. This version of the specification defines one 
additional metadata chunk denoted by Rev2 Auxiliary Data Chunk. The structure of this chunk is described in 
clause 5.7.2. The existence of Rev2 Auxiliary Data Chunk is not dependent on the value of the AUXF flag, i.e. the Rev2 
Auxiliary Data Chunk may be encoded in the stream even when AUXF=FALSE. 

5.7 Optional Information 

This clause describes metadata blocks that may optionally be included in the bitstream. 



5.7.1 Auxiliary Data 

Table 5-31 : Core AUX data 



Auxiliary data 


Size (bits) 


/ / Advance to DWORD boundary 

InputFrame . Advance2NextDWord ( ) ; 

// Extract AUX Sync Word 

nSYNCAUX = InputFrame . ExtractBits { 32 ) ; 


32 
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Auxiliary data 



Size (bits) 



false ; 



// Extract AUX decode time stamp flag 
bAUXTimeStampFlag = (ExtractBits (1) ==1) ? true 
// Extract AUX decode time stamp 
if ( bAUXTimeStampFlag ) { 

InputFrame . Advance2Next4BitPos ; 

nMSByte = ExtractBits ( 8 ) ; 

nMarker = ExtractBits (4) ; // nMaker==1011 
nLSByte28 = ExtractBits ( 2 8 ) ; 
nMarker = ExtractBits (4) ; // nMaker==1011 
nAUXTimeStamp = (nMSByte << 28) | nLSByte2 8; 

} 

// Extract AUX dynamic downmix flag 

bAUXDynamCoef f Flag = (ExtractBits (1) ==1) ? true : false; 
blnitDwnMixCoef f = false ; 
DeriveNumDwnMixCodeCoef f s ( ) 
{ 

unsigned int nPriCh, nNumCoeffs = ; 
nPriCh = anNumCh [ AMODE ] ; 
if (LFF > ) 
nPriCh++ ; 

// recall these tables do NOT include a scale row! 
// Check m_nPrmChDownMixType 
switch ( m_nPrmChDownMixType ) 
{ 

case DTSDOWNMIXTYPE_1_0 : 
nNumCoeffs = nPriCh; 
m_nNumChPrevHierChSet = 1 ; 
break ; 

case DTSDOWNMIXTYPE_LoRo : 
case DTSDOWNMIXTYPE_LtRt : 

nNumCoeffs = 2*nPriCh; 

m_nNumChPrevHierChSet = 2 ; 

break ; 

case DTSDOWNMIXTYPE_3_0 : 
case DTSDOWNMIXTYPE_2_l : 

nNumCoeffs = 3*nPriCh; 

m_nNumChPrevHierChSet = 3 ; 

break ; 

case DTSDOWNMIXTYPE_2_2 : 
case DTSDOWNMIXTYPE_3_l : 

nNumCoeffs = 4*nPriCh; 

m_nNumChPrevHierChSet = 4 ; 

break ; 
default : 

nNumCoeffs = 0; 

m_nNumChPrevHierChSet = ; 

break ; 

} 

return nNumCoeffs; 

} 

// Unpack the coefficients 
if ( bAUXDynamCoef f Flag ) 
{ 

// Extract the downmix type for primary ch 
nPrmChDownMixType = ExtractBits (3 ) ; 
// Extract the downmix code coeffs 

nNumDwnMixCodeCoef f s = DeriveNumDwnMixCodeCoef fs () ; 

} 

// Extract AUX dynamic downmix coeff codes 
if ( bAUXDynamCoef f Flag ) { 

if ( ! ReallocDwnMixCodeCoeff (nNumDwnMixCodeCoef fs) ) 

return false; 
for (n=0; n < nNumDwnMixCodeCoef fs ; n++) 
{ 

nTmp = ExtractBits (9) ; 
panDwnMixCodeCoef f s [n] = nTmp; 

} 

} for (nIndPrmCh=0 , n=0; nIndPrmCh<m_nNumChPrevHierChSet ; nIndPrmCh++) ■ 
for (nIndXCh=0; nIndXCh<nPriCh; nIndXCh++, n++) { 
nTmp = m_panDwnMixCodeCoef f s [n] ; 

// Extract and test the MSB ( NB ITS FORDM IXCOEFFWORD-bit words) 
// If 1 - > in phase (+1) ; if -> Out of phase (-1) 
nSign = ( nTmp & nMaskl ) ? 1 : -1; 
nTmp = (nTmp & nMask2) ; 
if (nTmp>0) { 

nTmp--; // -Infinity is not part of the table so decrement index 
if (nTmp>nTblSize) 
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Auxiliary data 


Size (bits) 


return false ; 
/ / convert 24- bit signed coef f s in Q15 to real 
m panCoreDwnMixCoef f s [n] = (nSign*DmixCoeff Table [nTmp] ) ; 

} 

else 

m_panCoreDwnMixCoef f s [n] = 0.0; 

} } 

ByteAlign = ExtractBits ( ... 7) ; 
nRev2AUXCRC16 = ExtractBits ( 16 ) ; 


0to7 
16 



Navigation to the start location of the auxiliary data is achieved by either reading the AUXCT variable and traversing to 
the next DWORD or by searching for the DWORD aligned AUX sync word 0x9Al 105A0 from the end of the audio 
frame. Since the data in the auxiliary may be required prior to unpacking the subframe data, the latter approach of 
searching for the AUX sync word is the suggested method. The auxiliary data may include a 36-bit time-stamp for 
decode synchronization and any dynamic downmix coefficients. 

nSYNCAUX (Auxiliary Sync Word) 

The auxiliary sync word is 0x9A1105A0 and is DWORD aligned. 
bAUXTimeStampFlag (Auxiliary Decode Time Stamp Flag) 
Indicates if a decode time stamp is present in auxiliary data. 
nAUXTimeStamp (Auxiliary Decode Time Stamp) 

Decode time stamp for synchronizing core audio with another audio stream such as a DTS substream. 
The timestamp data is a 36 bit field composed as follows: 

TimeStamp = (Hours x Mins x Sec x SampleRate) + SampleOffset 

where: 

Hours has range to 23, 
Mins has range to 59, 
Sec has range to 59, 

SampleRate may be 32 000, 44 100 or 48 000 as deduced from the SFREQ table (see Table 5-5), 

SampleOffset has range of to 3 1 999, 44 099 or 47 999. 

The timestamp of an encoded frame (AO corresponds to that time at the first edge of the first sample period at the start of 
an audio frame N entering into the encoder. The time stamp may also be thought of as a sample counter in which case a 
value of or other initial value corresponds to the beginning of the first sample period within the first audio frame as 
processed by the encoder 

bAUXDynamCoeffFlag (Auxiliary Dynamic Downmix Flag) 

If this flag is true, it indicates that the down mixing coefficients are included in the stream. 
nPrmChDownMixType (Auxiliary Primary Channel Downmix Type) 

Designates the dynamic downmix type for the primary channels when bAUXDynamCoeffFlag is true. The downmix 
type is defined by the parameters in Table 5-32. 
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Table 5-32: Downmix Channel Groups 



nPrmChDownMixType 


Downmix primary 


000 


1/0 


001 


Lo/Ro 


010 


Lt/Rt 


011 


3/0 


100 


2/1 


101 


2/2 


110 


3/1 


111 


Unused 



The downmix coefficients are packed as a NxM table of coefficients. To determine the total number of downmix 
coefficients packed in the stream, use the nPrmChDownMixType to determine the M (number of resultant downmix 
channels and the total number of channels encoded to determine N). 

panDwnMixCodeCoeffs (Dynamic Downmix Code Coefficients) 

Use the nPrmChDownMixType (designated number of fold down channels) and the number of primary channels + LFE 
channel (if included in audio frame) to determine the number of dynamic downmix code coefficients to extract. Each 
code coefficient is 9 bits. 

See clause D. 1 1 for table lookup to convert encoded downmix coefficient to actual coefficient values. 
ByteAlign for nAUXCRC16 (Pad to BYTE Boundary) 

This ensures that the nAUXCRC16 field that follows is aligned to a byte boundary to allow fast table-based CRC16 
calculation. Append '0's until the bit position is a multiple of 8. 

nAUXCRC16 (Auxiliary CRC-16 value) 

An auxiliary CRC-16 value is provided to verify both the detection of the auxiliary sync word and the contents of the 
auxiliary data. Traverse to next byte boundary and read out auxiliary CRC value. This CRC value is calculated for the 
auxiliary data from positions bAUXTimeStampFlag to the byte prior to the start of the CRC inclusive. 

Reserved (Reserved) 

This field is reserved for additional auxiliary information. The decoder shall assume that this field is present and of 
unspecified duration. Therefore in order to continue unpacking the stream, the decoder shall skip over this field using 
the auxiliary data start pointer and the auxiliary data byte size AUXCT. 

5.7.2 Rev2 Auxiliary Data Chunk 

The Rev2AUX data chunk contains broadcast metadata such as Dynamic Range Control (DRC) and dialog 
normalization (dialnorm). Navigation to the start location of the Rev2 auxiliary data chunk is achieved by searching for 
the DWORD-aligned nSYNCRev2AUX sync word (0x7004C070) by using one of the two following methods: 

• searching forward starting after all auxiliary data bytes AUXD are extracted; or 

• searching backward starting from the end of the audio frame. 

The detected sync word shall be verified by checking the CRC check sum nRev2AUXCRC16. Currently the Rev2 
auxiliary data may include an ES metadata flag and a corresponding down-mix attenuation level. In the future, 
additional metadata may be added in reserved field of the Rev2 Auxiliary Data Chunk. 

Padding for nSYNCRev2AUX (ZeroPadRev2Aux) 

The beginning of Rev2 Auxiliary Data Chunk is aligned on the 32-bit boundary from the beginning of the core stream 
(DWORD aligned). This is achieved by inserting a necessary number of zero bits prior to the nSYNCRev2AUX data. 
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5.7.2.1 Rev2 Auxiliary Data Chunk structure 

The Rev2 Auxiliary Data Chunk structure is shown in Table 5-33. 

Table 5-33: Rev2 Auxiliary Data Chunk Structure 



Rev2 Auxiliary Data Chunk Structure 



Size (Bits) 



nSYNCRev2AUX = ExtractBits ( 32 ) ; 
nRev2AUXDataByteSize = ExtractBits ( 7 ) + 1; 

if ( (nRev2AUXDataByteSize < 3) || (nRev2AUXDataByteSize > 128)) 
{ 

Error: Invalid range of Rev 2 Auxiliary Data Chunk Size 

} 

bESMetaDataFlag = (ExtractBits (1) 
if (bESMetaDataFlag==TRUE) 



1) ? TRUE : FALSE ; 



// Extract Embedded ES Downmix Scale Index 
nEmbESDownMixScalelndex = ExtractBits ( 8 ) ; 
// Check index range 

if {(nEmbESDownMixScalelndex < 40) || (nEmbESDownMixScalelndex >240)){ 
// Handle error: Invalid Index For a ES Downmix Scaling Parameter 
// Look up the scale factor 

ESDmixScale = DmixTable [nEmbESDownMixScalelndex] ; 

} 

if (nRev2AUXDataByteSize>4) 

bBroadcastMetadataPresent = (ExtractBits (1) == 1) ? TRUE : FALSE; 

else 

bBroadcastMetadataPresent = FALSE, - 
If (bBroadcastMetadataPresent == TRUE) 
{ 

bDRCMetadataPresent = (ExtractBits (1) == 1) ? TRUE : FALSE; 
bDialnormMetadata = (ExtractBits (1) == 1) ? TRUE : FALSE; 

if (bDRCMetaDataPresent == TRUE) { 

// Extract the DRC Version Number 
DRCversion_Rev2AUX = ExtractBits (4 ) ; 

} 

// Byte align to next field 
nByteAlignO = ExtractBits (...) ; 

// Extract the Rev2AUX DRC values, if present 

if (bDRCMetaDataPresent == TRUE) { 

// assumes DRCversion_Rev2AUX == 1: 

for (subSubFrame=0 ; subSubFrame < nSSC; subSubFrame++ ) 

subsubFrameDRC_Rev2AUX [subSubFrame] = dts_dynrng_to_db (ExtractBits (£ 

} 

// Extract DIALNORM_rev2aux, if present 
if (bDialnormMetadata == TRUE) { 

D I ALNORM_r ev2 aux = ExtractBits ( 5 ) ; 

DNG = - ( D I ALNORM_r ev2 Aux ) ; 

} 

} 

ReservedRev2Aux = ExtractBits (...) ; 
ByteAlignf orRev2AuxCRC = ExtractBits (...) ,- 
nRev2AUXCRC16 = ExtractBits ( 16 ) ; 



32 
7 



1 

8 



1 
1 
4 

1,...,7 
8xsubsubframes 



See description 
0,...,7 
16 



5.7.2.2 Description of Rev2 Auxiliary Data Chunk fields 

nSYNCRev2AUX (Rev2 Auxiliary Data Chunk Sync Word) 

The DWORD-aligned Rev2 Auxiliary Data Chunk synchronization word has the value 0x7004C070. 
nRev2AUXDataByteSize (Rev2 Auxiliary Data Byte Size) 

The nRev2AUXDataByteSize is equal to the size of the Rev2 Auxiliary Data Chunk in bytes from the 
nRev2AUXDataByteSize to nRev2AUXCRC16 inclusive. This marker also designates the end of the field 
nRev2AUXCRC16 and allows quick location of the checksum at byte position nRev2AUXDataByteSize - 2 offset from 
the nRev2AUXDataByteSize inclusive. The nRev2AUXDataByteSize is an unsigned integer with a valid range 
between 3 and 128 inclusive. 
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bESMetaDataFlag (ES Metadata Flag) 

When the bESMetaDataFlag is TRUE, metadata related to the embedded down-mix from an extended surround (ES) 
layout is present in the stream, namely nEmbESDownMixScalelndex which is an index into DmixTable[ ]. When the 
bESMetaDataFlag is TRUE, it informs the core decoder that the channels encoded in the core stream represent a 
down-mix from some extended surround (ES) layout with >5.1 channels. If the bESMetaDataFlag is FALSE or 
bESMetaDataFlag is not encoded in the stream (i.e. Rev2 Auxiliary Data Chunk), then either no embedded ES 
downmix metadata exists or should not be used. 

nEmbESDownMixScalelndex (Embedded ES Downmix Scale Index) 

This field is encoded in the stream only if the bESMetaDataFlag is TRUE. It corresponds to the amount of attenuation 
that is applied to the core encoded channels during the process of embedded ES down-mixing. This information is only 
needed when the core decoded audio represents primary audio that is going to be mixed with the secondary audio. In 
this case, the mixer needs to obtain this attenuation information from the core decoder in order to adjust the level of 
secondary audio prior to mixing. The core decoder may choose to combine this scaling with the scaling required by 
dialog normalization parameter. 

The encoded parameter nEmbESDownMixScalelndex represents an 8-bit index into the DmixTable[ ], which is a scale 
factors look-up table listed in clause D. 1 1 . Although the ESDmixScale parameters are obtained from the DmixTable[ ], 
their range is limited on the encode side to [-40 dB, dB]. This corresponds to the valid range for 
nEmbESDownMixScalelndex to be between 40 and 240 inclusive. 

The entries in DmixTablef ] are unsigned 16-bit integer numbers representing the numbers in column Abs Values [] 
(from the same table), after multiplication by 2 15 and rounding to the nearest integer value. The ESDmixScale values 
are unsigned integer numbers obtained from the entries of DmixTablef ]. 

To obtain the actual ES down-mix scale factor (ESDmixScale), the logic shown in Table 5-33 shall be followed. 
bBroadcastMetaDataPresent (Broadcast Metadata Present Flag) 

When the bBroadcastMetaDataPresent flag is TRUE, metadata related to the application of DRC and Dialnorm are both 
present in the Rev2AUX chunk. Therefore, bDRCMetaDataPresent and bDialnormMetadataPresent flags shall be 
checked and any associated metadata shall be applied to the core stream channels. 

Presence of the bBroadcastMetaDataPresent flag and the associated metadata may NOT be guaranteed even when the 
Rev2AUX data chunk is present in the stream. Decoders that use this metadata SHALL determine the presence of the 
bBroadcastMetaDataPresent flag based on the nRev2AUXDataByteSize. In particular the bBroadcastMetaDataPresent 
flag is present in the stream if and only if the nRev2AUXDataByteSize > 4. 

bDRCMetaDataPresent (DRC Metadata Present Flag) 

This flag will be present if bBroadcastMetaDataPresent flag is TRUE. When the bDRCMetaDataPresent flag is TRUE, 
metadata related to the application of DRC will be present in the Rev2AUX chunk and the DRC values in the 
Rev2AUX data chunk should be used instead of any dynamic range control coefficients found in the legacy core stream 
(indicated by flag DYNF ) . In addition, channels encoded in the core stream should apply the subsequent DRC values to 
subsubframes. If the bBroadcastMetaDataPresent flag is FALSE, no subsubframe DRC values are present in the 
Rev2AUX data chunk. 

bDialnormMetaDataPresent (Dialnorm Metadata Present Flag) 

This flag will be present if bBroadcastMetaDataPresent flag is TRUE. When the bDialnormMetaDataPresent flag is 
TRUE, metadata related to the application of dialnorm will be present in the Rev2AUX chunk and the dialog 
normalization values in the Rev2AUX data chunk should be used instead of the DIALNORM field found in the legacy 
core stream. In addition, the Rev2AUX dialnorm value should be smoothed and applied to the channels encoded in the 
core stream. If the bBroadcastMetaDataPresent flag is FALSE, no dialnorm values are present in the Rev2Aux data 
chunk. 
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DRCversion_Rev2AUX (DRC Version) 

This field will be present only if bBroadcastMetaDataPresent flag is TRUE and if bDRCMetaDataPresent flag is TRUE. 
DRCversion_Rev2AUX is a four bit field which is used to determine the version of DRC algorithm which the encoder 
used. The first version starts at 0x1. Decoders will support DRC version 0x1 up to the latest version which they support. 
Currently only DRCversion_Rev2AUX = 1 is supported in the Rev2Aux chunk. If the encoder is supplying DRC 
information with a version number higher than that which is supported by the decoder, the supplied Rev2Aux DRC 
values should be ignored and no DRC should be applied. 

nByteAlignO (ByteAlignvaries ) 

This 0-bit padding ensures that the field that follows is aligned to a byte boundary. Append '0' bits until the bit position 
is a multiple of 8. This field will be 1 bit if the DRCversion_Rev2AUX field was present and will be 5 bits if the 
DRCversion_Rev2AUX field was not present. 

subsubFrameDRC_Rev2AUX[ ] (DRC Values) 

This field will be present only if bBroadcastMetaDataPresent flag is TRUE and if bDRCMetaDataPresent flag is TRUE. 
Each subsubFrameDRC_Rev2AUX[ ] field falls on a byte boundary. Currently only DRC version 1 is supported, which 
is single band mode. In single band mode, one 8 bit value is transmitted for each subsubframe, as detailed in Table 5-34. 

Table 5-34: Number of Rev2AUX DRC bytes transmitted per frame length 



Frame Length 


DRC bytes (1 per subsubframe) 


512 


2 


1 024 


4 


2 048 


8 



In the CA stream, each subsubframe is 256 samples in length, corresponding to 5,33 ms at 48 000 samples per second 
(5,33 ms = subsubframe length in sample/ sampling rate). Table 5-35 shows the relationship between the frame sizes 
and number of bits used for DRC values: 



Table 5-35: Rev2AUX DRC bits per frame 



Frame 
length in 
samples 


Number of 
subsubframes 
per frame 


Frames/sec 


No. of bits 
used for 
DRC values 
per frame 


Bits/sec 
for DRC 
values 
only 


DRC extension 
overhead 
(bits/frame) 


DRC extension 
total not 

including zero 
padding 
(bits/frame) 


DRC 
extension 
total not 
including 
zero padding 
(bits/sec) 


512 


2 


9 375 


16 


1 500 


10 


26 


2 438 


1 024 


4 


46 875 


32 


1 500 


10 


42 


1 969 


2 048 


8 


234 375 


64 


1 500 


10 


74 


1 808 



Each DRC byte value is extracted from the bitstream and converted into a dB gain by function 
dts_dynrng_to_db ( ) . 

DIALNORM_rev2aux (Dialog Normalization Parameter) 

The DIALNORM_rev2aux field will be present only if bBroadcastMetaDataPresent flag is TRUE and if 
bDialnormMetaDataPresent flag is TRUE. Field DIALNORM_rev2aux falls on a byte boundary. If the encoded stream 
contains both a DIALNORM field and a DIALNORM_rev2aux field, DIALNORM_rev2aux takes priority. 
DIALNORM_rev2aux is a 5-bit field which is used to determine the dialog normalization parameter. 

The dialog normalization gain (DNG), in dB, is specified by the encoder operator and is used to directly scale the 
decoder output samples. In the DTS stream, the information about the DNG value is transmitted as described in 
Table 5-36. 
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Table 5-36: Rev2AUX Dialog Normalization Parameter 



Dialog Normalization Gain (DNG) 
Applied to the Decoder Outputs 
|ao j 


DIALNORM_rev2aux 
(binary) 


DIALNORM_rev2aux 
(unsigned int) 


u 


uuuuuuu 


u 


i 


UUUUUU I 


i 


_o 

c. 


nhnnm n 

UUUUU I u 


o 


o 


nhnnm 1 


o 


-4 


nhnm nn 

UUUU I uu 


A 


o 


nhnm m 

UUUU I U I 


o 


-fi 


nhnm 1 n 

UUUU I I u 


c 
o 


.7 


nhnm 1 1 

UUUU I I I 


7 


-ft 
o 


nhm nnn 

UUU I uuu 


Q 

o 




nhm nm 

UUU I UU I 


q 


-1 n 

I u 


nhm m n 

UUU I U I u 


1 n 

I u 


-1 1 
i i 


nhm m 1 

UUU IUII 


i i 


-1 9 
I 


nhm 1 nn 

UUU I I uu 


1 9 
1 c. 


1 o 


nhm 1 m 

UUU I I U I 


1 T 


1 *r 


nhm 1 1 n 

UUU I I I u 


1 A 

1 H 


-1 R 
I O 


nhm 111 

UUU 1 1 I I 


1 R 

I o 


-1 fi 
I D 


nhi nnnn 

UU I UUUU 


1 R 

I D 


-17 
I / 


nhi nnm 

UU I UUU I 


1 7 
I / 


-1 ft 
I o 


nhi nm n 

UU I UU I u 


1 ft 
I o 


-1 Q 
I y 


nhi nm 1 

UU I UU I I 


1 Q 
i y 


-9n 
*1U 


nhi m nn 

UU I U I UU 


9n 

^1U 


-91 
I 


nhi m m 

UU I U I U I 


91 

c. 1 


-99 


nhi m 1 n 

UU I U I I u 


99 


9^ 


nhi m 1 1 

UU I U I I I 


9T 


-24 


nhi innn 

\JU 1 1 UUU 


24 


-25 


0b1 1001 


25 


-26 


0b1 1010 


26 


-27 


0b1 101 1 


27 


-28 


obmoo 


28 


-29 


0b1 1 101 


29 


-30 


0b1 1110 


30 


-31 


0b1 1111 


31 



ReservedRev2Aux (Reserved bits) 

This field is reserved for additional metadata information that may be added in the future. The decoder shall assume that 
this field is present and of unspecified duration. Therefore, in order to continue unpacking the stream, the decoder shall 
skip over this field skipping nRev2AUXDataByteSize bytes from the bESMetaDataFlag inclusive. 

ByteAlignforRev2AuxCRC (Pad for nRev2AUXCRC16) 

This zero-padding field ensures that the nRev2AUXCRC16 field that follows is aligned to a byte boundary to allow fast 
table-based CRC16 calculation. Append '0' bits until the bit position is a multiple of 8. 

nRev2AUXCRC16 (Rev2 Auxiliary CRC-16 value) 

A Rev2 auxiliary CRC-16 value is provided to verify both the detection of the Rev2 auxiliary sync word and the 
contents of Rev2 Auxiliary Data Chunk. To locate the position of the nRev2AUXCRC16 data field, start from the 
beginning position of field nRev2AUXDataByteSize and jump forward (nRev2AUXDataByteSize - 2) bytes. This CRC 
value is calculated for the Rev2 Auxiliary Data Chunk from the position of nRev2AUXDataByteSize to the 
ByteAlignforRev2AuxCRC, inclusive. 
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6 Core Extensions 

The generalized concept of core + extension coding is well established in the context of DTS encoding. The present 
document describes the extensions (components) found in any DTS stream. These components include: 

• XCH - Extra centre surround (Cs) channel with 6. l->5. 1 down-mix embedded in the core stream using default 
down-mix coefficients (Ls dra = Ls+0,7071Cs; Rs dm = Rs+0,7071Cs). 

• X96 - High frequency components introduced by higher sampling rates (88,2/96 kHz). 

• XBR - Extended resolution for the channels encoded in the core sub-stream (requires bit-rates > 1,5 Mbps but 
guarantees backward compatibility of a 1,5 Mbps core sub-stream). 

• XXCH - Extra channels beyond 5.1. 

Some of these components may exist in either the core sub-stream or the extension sub-stream. The extension 
substream is defined in clause 7. 

6.1 X96 Extension 

The generalized concept of core+96 kHz -extension coding is illustrated in Figure 6-1. To encode 96 kHz LPCM, the 
input audio stream is fed to a 96 to 48 kHz down sampler and the resulting 48 kHz signal is encoded using a standard 
core encoder as in Figure 6-1, Section A, as follows: 

• In the "Preprocess Input Audio" block, the original 96 kHz/24-bit LPCM audio is first delayed and then passed 
through the extension 64-band analysis filter bank. Signal "1" in this case consists of the extension subband 
samples at 96 kHz/64. 

• The core data consists of the core audio codes in 32 subbands. In the "Reconstruct Core Audio Components" 
block, the core audio codes are inversely quantized to produce the reconstructed core subband samples at 
48 kHz/32. These subband samples correspond to signal "2". 

• In the "Generate Residuals" block, the reconstructed core subband samples are subtracted from the extension 
subband samples in the lower 32 subbands. The extension subband samples in the upper 32 bands remain 
unaltered. These residual subband samples in the 64 bands correspond to signal "3". 

• The "Generate Extension Data" block processes the residual subband samples and generates the extension data 
that, along with the core data, is assembled in a packer to produce a core + extension bitstream. 

In the 96 kHz decoder, as in Figure 6-1, Section B, the unpacker first separates the core + extension stream into the core 
and extension data. The core subband decoder, in the Reconstruct Core Audio Components block, processes the core 
data and produces the reconstructed core subband samples (same as signal "2" generated in the encoder). Next, in the 
Reconstruct Residual Components block, the extension subband decoder uses the extension data to generate the 
reconstructed residual subband samples in the 64 bands. In the Recombine Core and Residual Components block the 
core subband samples are added to the lower 32 bands of residual subband samples to produce the extension subband 
samples in the 64 bands. In the same block, the synthesis 64-band filter bank processes the extension subband samples 
and generates the 96 kHz 24-bit LPCM audio. The combining of reconstructed residuals and core signals on the decoder 
side, as in Figure 6-1, Section B, is also done in subband domain. 
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C) 48 kHz (Legacy) Decoder 



Figure 6-1 : The Concept of Core + Extension Coding Methodology 

When a 48 kHz-only (legacy) decoder is fed the core + extension bitstream, as in Figure 6-1, Section C, the extension 
data fields are ignored and only the core data is decoded. This results in 48 kHz core LPCM audio output. 

6.1 .1 DTS Core + 96 kHz-Extension Encoder 

The block diagram in Figure 6-2 shows the main components of the encoding algorithm. The input digital audio signal 
with a sampling frequency up to 96 kHz and a word length up to 24 bits is processed in the core branch and extension 
branch. In the core branch input audio is low-pass filtered (LPF) to reduce its bandwidth to below 24 kHz and then 
decimated by a factor of two, resulting in a 48 kHz sampled audio signal. The purpose of this LPF decimation is to 
remove signal components that cannot be represented by the core algorithm. The down sampled audio signal is 
processed in a 32-band analysis cosine modulated filter bank (QMF) that produces the core subband samples. The core 
bit allocation routine based on the energy contained in each of the subbands and configuration of the core encoder 
determines the desired quantization scheme for each of the subbands. The core subband encoder performs quantization 
and encoding after which the audio codes and side information are delivered to the packer. The packer assembles this 
data into a core bitstream. The X96 extension can either be an extension of the core audio frame or included with a DTS 
substream. 

In the extension branch the delayed version of input audio is processed in a 64-band analysis cosine modulated filter 
bank (QMF) that produces the extension subband samples. Inverse quantization of the core audio codes produces the 
reconstructed core subband samples. Subtracting these samples from the extension subband samples in the lower 
32 bands generates the residual subband samples. The residual signals in the upper 32 subbands are unaltered extension 
subband samples in corresponding bands. The delay of input audio is such that reconstructed core subband samples and 
extension subband samples in the lower 32 bands are time-aligned before the residual signals are produced i.e.: 

Delay = Delay DecimationLPF + Delay Core Q MF - Delay Extension Q MF . 

The extension bit allocation routine based on the energy of residuals in each of the subbands and configuration of the 
extension encoder determines the desired quantization scheme for each of 64 subbands. The residual samples in 
subbands are encoded using a multitude of adaptive prediction, scalar/vector quantization and/or Huffman coding to 
produce the residual codes and extension side information. The packer assembles this data into an extension bitstream. 
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Figure 6-2: The Block Diagram of DTS Core + Extension Encoder 



6.1 .2 DTS Core + 96 kHz Extension Decoder 

On the decoder side, the core and extension parts of the encoded bitstream are fed to their respective subband decoders. 
The reconstructed core subband samples are added to the corresponding residual subband samples in lower 32 bands. 
The reconstructed residual subband samples in the upper 32 bands remain unaltered. Passing the resulting extension 
subband samples through the synthesis 64-band QMF filter bank produces the 96 kHz sampled PCM audio. Figure 6-3 
shows the block diagram of the core + extension decoder. 

If the encoded bit-stream does not contain the extension data, the decoder, based on its hardware configuration, uses: 

• a 32-band QMF with core subband samples as inputs to synthesize the 48 kHz sampled PCM audio; 

• a 64-band QMF with inputs being core subband samples in the lower 32 bands and "zero" samples in the upper 
32 bands to synthesize the interpolated PCM audio sampled at 96 kHz. 

The existing DTS core decoders when receiving the core + extension bitstream will extract and decode the core data to 
produce the 48 kHz sampled PCM audio. The decoder ignores the extension data by skipping the extraction until the 
next DTS synchronization word. 
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Figure 6-3: The Block Diagram of DTS Core + extension Decoder 



6.1 .3 Extension (X96) Bitstream Components 

The X96 extension may be a part of the core substream (denoted by the DTS_BCCORE_X96) or a part of the extension 
substream (denoted by the DTS_EXSUB_STREAM_X96). During the synchronization procedure, the decoder will 
determine whether the DTS_BCCORE_X96 or DTS_EXSUB_STREAM_X96 is being decoded. 

The frame of X96 data is divided into a frame header and up to four channel sets. The frame header structure is different 
for the case of DTS_BCCORE_X96 extension and the case of DTS_EXSUB_STREAM_X96 extension. In case of the 
DTS_BCCORE_X96 only one channel set may exist in the X96 stream. 

Each channel set has its own channel set header as shown in Figure 6-4. In the case of DTS_EXSUB_STREAM_X96, 
the Cycle Redundancy Check (CRC) word is included at the end of frame header to allow detection of errors in the 
frame header data. In addition, the CRC words may be included at the end of each channel set header to allow detection 
of errors in the channel set header data. 




Figure 6-4: X96 data frame structure 
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The channel set data is subdivided into the subframes. Each subframe consists of a subframe header and the audio data. 
The audio data is organized in subsubframes as shown in Figure 6-5. The number of subframes is the same for all 
channel sets and is equal to the number of subframes in the core frame. Similarly, the number of subsubframes is the 
same for all subframes of all channel sets and is equal to the number of subsubframes within each subframe of the core 
frame. In other words, the subframe and subsubframe partitioning within the X96 frame follows the partitioning present 
in the core frame. 
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Figure 6-5: Channel Set Data Structure 



6.1 .3.1 DTS_BCCORE_X96 Frame Header 

This clause describes the X96 extension when it is present in the core extension of the core extension substream. 

Table 6-1 : DTS BCCORE X96 Frame Header Structure 



DTS_ BCCORE X96 Frame Header Syntax 


Size (Bits) 


SYNC96= ExtractBits(32); 


32 


FSIZE96 = ExtractBits(12)+1; 


12 


REVNO = ExtractBits(4); 


4 



SYNC96 (DTS_BCCORE_X96 Extension Sync Word) 

The synchronization word SYNC96 = 0xlD95F262 for the DTS_BCCORE_X96 extension data comes after the core 
audio data within the core substream. For 16-bitstreams the sync word is aligned to 32-bit word boundary. In the case of 
14-bitstreams SYNC96 is aligned to both 32-bit and 28-bit word boundaries meaning that 28 MSBs of the SYNC96 
appear as 0x0765 1F26. 

To reduce the probability of false synchronization caused by the presence of pseudo sync words, it is imperative to 
check the distance between the detected sync word and the end of current frame (as indicated by FSIZE). This distance 
in bytes shall match the value of FSIZE96 (see below). 

After the synchronization is established a flag nX96Present is set and the output sampling frequency is selected as: 

OutSamplingFreq = SFREQ; 

if ( nX96Present) 
OutSamplingFreq = 2*OutSamplingFreq; 

NOTE: SFREQ corresponds to a sampling frequency of reconstructed audio in the core decoder. 
FSIZE96 (DTS_BCCORE_X96 Extension Frame Byte Data Size) 

FSIZE96 is the byte size of DTS_ BCCORE _X96 extension data. Valid range for FSIZE96: 96 - 4 096. 
REVNO (Revision Number) 

The revision number for the high-frequency extension processing algorithm. 



ETSI 



47 



ETSI TS 102 114 V1. 4.1 (2012-09) 



Table 6-2: REVNO 



REVNO 


Frequency Extension Encoder Software Revision Number 





Reserved 


1 


Currently in use (compatible with the Rev1 .0 specification) 


2 - 7 


Future revision (compatible with the Rev1 .0 specification) 


8 


Currently in use (incompatible with the original Rev1 .0 specification) 


9- 15 


Future revision (incompatible with the original Rev1 .0 specification) 



Decoders designed using this Rev3.0 specification shall be able to decode all streams when REVNO < 9. If decoder is 
not compatible with some algorithm revisions (REVNO > 8), it shall ignore the DTS_ BCCORE_X96 extension stream. 

6.1 .3.2 DTS_EXSUB_STREAM_X96 Frame Header 

This clause describes the X96 extension when it is present in the extension substream of the DTS-HD stream. 



Table 6-3: DTS_EXSUB_STREAM_X96 Frame Header Structure 



DTS_EXSUB_STREAM_X96 Frame Header Syntax 


Size (Bits) 


SYNCX96 = ExtractBits (32) ; 


32 


nHeaderSizeX96 = ExtractBits ( 6 ) +1 ; 


6 


REVNO = ExtractBits (4) ; 


4 


bCRCPresent4ChSetHeaderX96 = ExtractBits ( 1 ) ; 


1 


nNumChSetsInX96 = ExtractBits ( 2 ) +1 ; 


2 


for (nChSet=0; nChSet < nNumChSetsInX96 ; nChSet ++) 


12 


pnuChSetFsizeX96 [nChSet] = ExtractBits { 12 ) +1 ; 


for (nChSet=0; nChSet < nNumChSetsInX96 ; nChSet ++) 




nuChInChSetX96 [nChSet] = ExtractBits ( 3 ) + 1; 


3 


ReservedHeaderX9 6 = ExtractBits (...) ; 




ByteAlignHeaderX96 = ExtractBits ( ... 7) ; 


0...7 


nCRC16HeaderX96 = ExtractBits ( 16 ) ; 


16 



SYNCX96 (DTS_EXSUB_STREAM_X96 Sync Word) 

The DWORD aligned DTS_ EXSUB_STREAM _X96 synchronization word has a value SYNCX96 = (MD95F262. 
During sync detection, the nCRC16HeaderX96 checksum is used to further verify that the detected sync pattern is not a 
random alias. After the synchronization is established, the flag (nX96Present) is set and the output sampling frequency 
is selected as: 

OutSamplingFreq = SFREQ; 
if ( nX96Present) 

NOTE: SFREQ corresponds to a sampling frequency of reconstructed audio in the core decoder. 
nHeaderSizeX96 (DTS_EXSUB_STREAM_X96 frame header length) 

The size of the header in bytes from SYNCX96 to nCRC16HeaderX96 inclusive. This value determines the location of 
the first channel set header. This marker also designates the end of the field, nCRC16HeaderX96 and allows quick 
location of the checksum at byte position nHeaderSizeX96-2. 

REVNO (Revision Number) 

The revision number for the high frequency extension processing algorithm has the same coding and interpretation as 
described in Table 6-2. 

bCRCPresent4ChSetHeaderX96 (CRC presence flag for channel set header) 

When bCRCPresent4ChSetHeaderX96 = true the 16-bit CRC word for the channel set header is present at the end of 
each channel set header. For the case of DTS_BCCORE_X96 the default value for the bCRCPresent4ChSetHeaderX96 
is false. 
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nNumChSetsInX96 (Number of Channel Sets) 

All channels within the DTS_EXSUB_STREAM_X96 extension are organized into individually decodable channel 
sets. The nNumChSetaInX96 is the number of channel sets that are present in X96 component. 

pnuChSetFsizeX9 (Channel Set Data Byte Size) 

The pnuChSetFsizeX96[nChSet] indicates the total number of data bytes in each nChSet channel set of the 
DTS_EXSUB_STREAM_X96 frame. Starting from the SYNCX96 and using the cumulative sum of nHeaderSizeX96 
and pnuChSetFsizeX96[k] (over all channel sets k=0, ... nChSet -1) as offset, the decoder may traverse to the beginning 
of channel set header data in the channel set nChSet. 

nuChInChSetX9 (Number of channels in a channel set) 

Indicates the number of channels in the channel set. 
ReservedHeaderX96 (Reserved) 

This field is reserved for additional DTS_EXSUB_STREAM_X96 header information. The decoder shall assume that 
this field is present and of unspecified length. Therefore, in order to continue unpacking the stream, the decoder shall 
skip over this field using the DTS_EXSUB_STREAM_X96 header start pointer and the DTS_EXSUB_STREAM_X96 
header size nHeaderSizeX96. 

ByteAlignHeaderX96 (Pad to BYTE boundary) 

This field ensures that the CRC16 field that follows is aligned to a byte boundary to allow fast table based CRC16 
calculation. Append '0's until bit position is a multiple of 8. 

nCRC16HeaderX96 (CRC16 of X96 frame header) 

This field represents the 16-bit CRC check word of the entire DTS_EXSUB_STREAM_X96 header from position 
nHeaderSizeX96 to ByteAlignHeaderX96 inclusive. 

6.1 .3.3 X96 Channel Set Header 



Table 6-4: X96 Channel Set Header Structure 



X96 Channel Set Header Syntax 


Size (Bits) 


if { m_nuCoreExtensionMask & DTS_EXSUB_STREAM_X96 ) 


7 


nuChSetHeaderSizeX96 = ExtractBits ( 7 ) +1 ; 




HIGHRES-FLAG96K = ExtractBits(l); 


1 


if <REVNO<8) { 




nSBS96 = ExtractBits (5) ; 




if <nSBS96<0 | | nSBS96>27) 




return SYNC ERROR DEF; 

} 


5 


else 




nSBS96=32; 




// For DTS_EXSUB_STREAM_X96 : nPCHS=nuChInChSetX96 [ChSet] ; 




// For DTS BCCore X96 nPCHS is obtained from the core 




for (ch=0; ch<nPCHS; ch++) { 




SBE96 [ch] = ExtractBits (6) ; 


6 


anSBE96 [ch] = SBE96 [ch] + 1 ; 




} 

for (ch=0; ch<nPCHS; ch++) 




JOINX96 [ch] = ExtractBits (3) ; 


3 


for (ch=0; ch<nPCHS; ch++) 




SHUFF96 [ch] = ExtractBits ( 3 ) ; 


3 


for <ch=0; ch<nPCHS; ch++) 




BHUFF96 [ch] = ExtractBits ( 3 ) ; 


3 
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X96 Channel Set Header Syntax 


Size (Bits) 


// ABITS96=2: 
n=0; 

for <ch=0; ch<nPCHS; ch++) 

SEL96 [ch] [n] = ExtractBits ( 1 ) ; 

// ABITS96 = 3 to 6 : 

for (n=l; n<5; n++) { 

for <ch=0; ch<nPCHS; ch++) 

SEL96 [ch] [n] = ExtractBits (2 ) ; 




} 

// ABITS96 = 7 

for (ch=0; ch<nPCHS; ch++) 

SEL96 [ch] [n] = ExtractBits { 3 ) ; 

if (HIGHRESFLAG96K == 1) { 
// ABITS96 = 8 to 11: 
for (n=5; n<10; n++) { 

for (ch=0; ch<nPCHS; ch++) 

SEL96 [ch] [n] = ExtractBits ( 3 ) ; 


Variable bits 


} 

// ABITS96 = 12 to 15: 
for (n=10; n<16; n++) { 

for (ch=0; ch<nPCHS; ch++) 

SEL96 [ch] [n] =0; // Not transmitted, set to 

} 




} 

if { m nuCoreExtensionMask & DTS EXSUB STREAM X96 ) { 
ReservedHeaderChSet = ExtractBits (...) ; 
ByteAlignHeaderChSet = ExtractBits { ... 7); 
If <bCRCPresent4ChSetHeaderX9S==true) 

nCRC16CheSetHeader = ExtractBits { 16 ) ; 

else{ 

if (CPF==1) 

AHCRC96 = ExtractBits (16) ; 

} 


0...7 
16 

16 


NuChSetHeaderSizeX96 (Channel set header length) 



This field is present only for the case of DTS_EXSUB_STREAM_X96 stream. The size of the channel set header in 
bytes from the nuChSetHeaderSizeX96 to either ByteAlignHeaderChSet (when bCRCPresent4ChSetHeaderX96 = 
false) or nCRC16HeaderChSet (when bCRCPresent4ChSetHeaderX96 = true), inclusive. This value determines the 
beginning of the channel set audio data. If the bCRCPresent4ChSetHeaderX96 = true, this marker also designates the 
end of the field nCRC16HeaderChSet and allows quick location of the checksum at byte position 
nuChSetHeaderSizeX96 - 2. 

HIGHRESFLAG96K (High Resolution Flag) 

The high resolution flag determines the upper limit on ABITS96 and consequently, the maximum number of 
quantization levels for the audio samples as described in Table 6-5. 



Table 6-5: High Resolution Flag 



HIGHRESFLAG96K 


ABIT 





0-7 


1 


- 15 



nSBS96 (First encoded subband transmitted only if REVNO<8) 

This field indicates the first active extension subband in each primary channel. Extension subband samples in subbands 
[0 : (nSBS96-l)] are assumed to be zeros. 

SBE96 and anSBE96 (Last encoded subband) 

This field indicates the last subband in the audio channel, ch, that is encoded without the use of joint intensity coding 
(31 < SBE96[ch] < 63). When the joint intensity coding in the audio channel, ch, is disabled, the extension subband 
samples in subbands above SBE96[ch] are assumed to be zeros (anSBE96 is used in the extension substream 
processing). 
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JOINX96 (Joint Intensity Coding Index) 

This field indicates if joint intensity coding is enabled for channel, ch and which audio channel is the source channel 
from which channel ch will copy subband samples. The construction of JOINX96 is done according to Table 5-22. 

SHUFF96 (Scale Factor Encoder Select) 

The scale factors of a channel are quantized nonlinearly using 6-bit (64-level, 2,2 dB per step) square root table. The 
quantization indices may be further compressed by one of the five Huffman codes (129 levels) and this information is 
transmitted to the decoder by SHUFF96[ch]. Scale factors are differentially encoded prior to the Huffman encoding. 



Table 6-6: Scale Factor Encoder Select SHUFF96 



SHUFF96 


Code Book 


Square-root Table 





SA129 


6 bit (clause D.1.1) 


1 


SB129 


6 bit (clause D.1.1) 


2 


SC129 


6 bit (clause D.1.1) 


3 


SD129 


6 bit (clause D.1.1) 


4 


SE129 


6 bit (clause D.1.1) 


5 


linear 


6 bit (clause D.1.1) 



BHUFF96 (Bit Allocation Quantizer Select) 

This field indicates which codebook should be used to encode the bit allocation index ABITS96. The bit allocation 
indices may be further compressed by one of the seven Huffman codes. This information is transmitted to the decoder 
by BHUFF96[ch]. When Huffman encoding is used the bit allocation indices are first differentially encoded. 



Table 6-7: Bit Allocation Encoder Select BHUFF96 



BHUFF96 


Code Book for HIGHRESFLAG96K=0 


Code Book for HIGHRESFLAG96K=1 





A17 


A33 


1 


B17 


B33 


2 


C17 


C33 


l 3 


D17 


D33 


4 


E17 


E33 


5 


F17 


F33 


L 6 


G17 


G33 


7 


3-bit linear 


4-bit linear 



SEL96 (Quantization Index Codebook Select) 

After subband samples are quantized using a mid-tread linear quantizer, the quantization indices are further encoded 
using entropy (Huffman) coding in order to reduce the bit rate. For channel ch, the selection of a particular codebook 
for a mid-tread linear quantizer indexed by ABITS96[ch] is transmitted to the decoder as SEL96[ch][ABITS96[ch]-2]. 
Table 6-8 depicts the quantization index codebook values. 
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Table 6-8: Quantization Index Codebook Select SEL96 



ARIT 

/-\DI 1 


Oi ianti7atir»n Tuno 
uucii iu£ciuui i i ypc 


if hitc for 
if mis iui 

SEL 

>JLL 


Quantization Index Code-book Select SEL 




| 1 | 2 | 3 | 4 | 5 | 6 | 7 





No Bits Allocated 





Not Transmitted 


1 


16 element VQ 





VQ16 
















2 


3-level SQ 


1 


A3 


V3 














3 


5-level SQ 


2 


A5 


B5 


C5 


V5 










4 


7-level SQ 


2 


A7 


B7 


C7 


V7 










5 


9-level SQ 


2 


A9 


B9 


C9 


V9 










6 


1 3-level SQ 


2 


A13 


B13 


C13 


V13 










7 


1 7-level SQ 


3 


A17 


B17 


C17 


D17 


E17 


F17 


G17 


V17 


In case of HIGHRESFLAG96K = 1 ABIT takes values from to 1 5 


8 


25-level SQ 


3 


A25 


B25 


C25 


D25 


E25 


F25 


G25 


V25 


9 


33 or 32 level SQ 


3 


A33 


B33 


C33 


D33 


E33 


F33 


G33 


NFE 


10 


65 or 64 level SQ 


3 


A65 


B65 


C65 


D65 


E65 


F65 


G65 


NFE 


11 


129 or 128 level SQ 


3 


A129 


B129 


C129 


D129 


E129 


F129 


G129 


NFE 


12 


256-level SQ 





NFE 
















13 


512-level SQ 





NFE 
















14 


1 024-level SQ 





NFE 
















15 


2 048-level SQ 





NFE 
















NOTE: NFE = No further encoding is used to encode the linearly quantized subband samples. A,B,C,D,E,F,G = 

Subband samples are encoded by Huffman code. V = 4 subband samples are grouped and encoded using 4- 
element block code. 



ReservedHeaderChSet (Reserved) 

This field is present only for the case of DTS_EXSUB_STREAM_X96 stream. This field is reserved for additional 
channel set header information. The decoder shall assume that this field is present and of unspecified duration. 
Therefore in order to continue unpacking the stream, the decoder shall skip over this field using the channel set header 
start pointer and the channel set header size nuChSetHeaderSizeX96. 

ByteAlignHeaderChSet (Pad to BYTE boundary) 

This field is present only for the case of the DTS_EXSUB_STREAM_X96 stream. This field ensures that the CRC16 
field that follows is aligned to a byte boundary to allow fast table based CRC16 calculation. Append '0's until bit 
position is a multiple of 8. 

nCRC16CheSetHeader (Channel Set Header CRC Check Word) 

This field is present only for the case of the DTS_EXSUB_STREAM_X96 stream and only when the 
bCRCPresent4ChSetHeaderX96 is true. It checks if there is any error in the bitstream from the beginning of channel set 
header data up to this point. 

AHCRC96 (Audio Header CRC Check Word) 

This field is present only for the case of the DTS_ BCCORE _X96 and only when the core parameter CPF is 1 . If this 
field is present, the value shall be extracted from the bitstream. This value will not be used. 

Unpack 96 kHz Extension Subframes 



Table 6-9: X96 Channel Subframe Processing 



X96 Extension Subframes 


Size (Bits) 


for (ch=0; ch<nPCHS; ch++) { 

for (n=nSBS96; n<anSBE96 [ch] ; n++){ 
PMODE96 [ch] [n] = ExtractBits ( 1 ) ; 

} 

} 

for (ch=0; ch<nPCHS; ch++) { 

for (n=nSBS96; n<anSBE96 [ch] ; n++){ 
if { PMODE96 [ch] [n] >0 ) { 
// Extract the PVQs 

nVQIndex = ExtractBits ( 12 ) ; 

// Look-up 4 ADPCM coefficients 

ADPCMCoef fVQ. Lookup (nVQIndex, raADPCMcoeff [ch] [n] , 4) 


1 bit per active subband 
12 bits per occurrence 
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X96 Extension Subframes 



Size (Bits) 



for (ch=0; ch<nPCHS; ch++) { 
nQSelect = BHUFF96 [ch] ; 
// Undo differential encoding 
QABITS96->ppQ [nQSelect] - >ClearDeltaSum ( ) ; 

for (n=nSBS96; n<anSBE96 [ch] ; n++) 

QABITS96->ppQ [nQSelect] - >InverseQ ( InputFrame , 



Variable bits per each 
active subband 



ABITS96 [ch] [n] 



for (ch=0; ch<nPCHS; ch++) { 
// Reset SCALES 

for (n=0; n<NumSubband; n++) { 
SCALES96 [ch] [n] = ; 

} 

// Select RMS table 
pScaleTable = &RMS6Bit ; 

// Select quantizer 
nQSelect = SHUFF96 [ch] ; 

// Clear differential accumulation. 
QSCALES .ppQ [nQSelect] - >ClearDeltaSum ( ) ; 
for (n=nSBS96; n<anSBE96 [ch] ; n++) { 
// Scale factor index 

QSCALES .ppQ [nQSelect] - >InverseQ ( InputFrame , nScale) 
// RMS look up 

pScaleTable- >LookUp (nScale, SCALES96 [ch] [n] ) ; 



for <ch=0; ch<nPCHS; ch++) 
if { JOINX96 [ch] >0 ) 

JOIN_SHUFF96 [ch] = ExtractBits ( 3 ) ; 
for (ch=0; ch<nPCHS; ch++) { 
if { JOINX96 [ch] >0 ) { 
// Get source channel. 
nSourceCh = JOINX96 [ch] - 1 ; 
// Select quantizer. 
nQSelect = JOIN_SHUFF96 [ch] ; 

for (n=anSBE96 [ch] ; n<anSBE96 [nSourceCh] ; n++) { 
// Extract joint scale factors 

pQJOIN_SCALES- >ppQ [nQSelect] - >InverseQ ( InputFrame , nJscale) 
// Biased by midpoint 
nJscale += 64; 

// Look up scale factor of joint intensity coding 
JScaleTbl. Lookup (nJscale, JOIN_SCALES96 [ch] [n] ) ; 



Variable bits per each 
active subband 



3 bits perch. Assuming 
JOINX96[ch]>0 



Variable bits 



if ( CPF == 1 ) 

SICRC96 = ExtractBits (16) ; 



16 bits (if CPF=1) 



6.1.3.4 96 kHz Extension Side Information 

PMODE96 

PMODE96 Indicates if ADPCM prediction is used (active) for each encoded subband of each primary audio channel. It 
is transmitted even for VQ encoded subbands. 

PVQ 

PVQ indexes to the vector codebook, (i.e. the same code book as in the core), to get the ADPCM prediction 
coefficients. It is transmitted only for subbands whose ADPCM is active. 
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ABITS96 

ABITS96[ch][n] are first difference-encoded. If HIGHRESFLAG96K = 0, the ABITS96 are then Huffman encoded 
using the 17 level codebooks, (clause D.5.8), otherwise they use the 33 level codebooks (clause D.5.10). This encoding 
is the same for all subbands (including the subbands that are VQ encoded) of each channel and is conveyed by 
BHUFF96. The index obtained after Huffman decoding indicates the quantizer that was used to quantize the subband 
samples for the n th subband of channel ch. 

SCALES96 

The quantization indices of the scale factors are encoded by 129-level Huffman codebooks (clause D.5.12). They are 
also difference-encoded before Huffman coding. The scale factors are obtained by using the quantization indexes to 
look up the 6-bit square-root quantization table (see Table 5-24). Single scale factor is transmitted per each active 
extension subband. The scale factors are transmitted even for the subbands with ABIT=0. 

JOIN_SHUFF96 

If joint subband coding is enabled, (JOINX96[ch]>0),then JOIN_SHUFF96[ch] selects which code book was used to 
encode JOIN_SCALES96 which will be used when copying subband samples from the source channel to the current 
channel ch. The extension joint scale factors are encoded in the same way as the extension SCALES and the codebook 
is obtained by Table 6-6. 

JOIN_SCALES96 

The scale factors are used to scale the subband samples copied from the source channel (JOINX96[ch]-l) to the current 
channel. The joint subbands start from the anSBE96 of the current channel until the anSBE96 of the source channel. 
Prior to its quantization the joint subband scale factors are normalized by the source channel scale factors in the 
corresponding subbands. The quantization index of the joint scale factor is encoded using the Huffman codebook 
indexed by JOIN_SHUFF96[ch]. The scale factors are obtained by using the quantization indexes to look up the 6-bit 
square root quantization in Table 5-24. 

SICRC96 

If CPF = 1 then SICRC96 shall be extracted from the bitstream. The CRC value test shall not be applied. 

6.1 .3.5 96 kHz Extension Audio Data Arrays 



Table 6-10: Extension Audio Data Arrays 



X96 Extension Audio Data Arrays 


Size (Bits) 


HFREQ96 

for (ch=0; ch<nPCHS; ch++) { 
nNumSamplSub-subFr = 8 ; 
nSsflter = nSSC/2; 

if { (nNumSamplSub-subFr*nSSC-nSsf Iter*16) !=0) 
nSsf Iter++ ; 

for (n=nSBS96; n<anSBE96 [ch] ; n++) { 
rScale = real (SCALES96 [ch] [n] [0] ) ; 

switch { ABITS96 [ch] [n] ) { 
case : 
// No bits allocated 

// Generate uniformly distributed random samples in range 

// [-0.5 , 0.5] and scale them with the extracted scale factor rScale 

aPrmCh[ch] .aSubband[n] . 

GenRandomSamples (nNumSamplSub- subFr*nSSC , rScale) ; 
break ; 

case 1 : 

prSample=aPrmCh [ch] .aSubband[n] . raSample [NumADPCMCoef f ] ; 
for (m=0; m<nSsfIter; m++) { 
// Unpack 

nVQIndex = ExtractBits ( 10 ) ; 

nNumElementVQ = nSSC*nNumSamplSub- subFr - m*16; 
nNumElementVQ= 

(nNumElementVQ>16 ) ? 16 : nNumElementVQ; 
// Look up 

HFreqVQ . Lookup (nVQIndex, prSample, nNumElementVQ) ; 
// Scale up 


10 bits per 
applicable subband 
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X96 Extension Audio Data Arrays 



Size (Bits) 



for (Ssfiter=0; Ssf iter<nNumElementVQ; Ssfiter++) 
* (prSample++) *= rScale; 

} 

break ; 
default : 



AUDIO DATA 



// Sub- sub- frame Loop 

for (nSub-sub-f rame=0 ; nSub-sub-f rame<nSSC; nSub- sub- f rame++ ) 
// Channel Loop 
for (ch=0; ch<nPCHS; ch++) { 
// Subband Loop 

for (n=nSBS96; n<anSBE96 [ch] ; n++) { 
nSample = nSub-sub-f rame*nNumSamplSub-subFr; 
nABITS = ABITS96 [ch] [n]-l; 



switch (nABITS) { 
case -1: 

// No bits allocated 

nQType = ; 

break; 
case : 

// VQ in current subband 

nQType = ; 

break; 
default : 

// Quantizer select 

nSEL = SEL96 [ch] [nABITS-1] ; 

nNumQ = pCQGroupAUDIO [nABITS-1] .nNumQ-1; 



VARIABLE BITS 



} 



nQType = 1; 
if ( nSEL==nNumQ ) { 
if ( nABITS<=7 ) 

nQType = 3; 
else 

nQType = 2 ; 

} 

pCQGroup = kpCQGroupAUDIO [nABITS-1] ; 



// Number of quantizers 
// Determine quantizer type 
// Assume Huffman quantizers as default 



// Block quantizers 
// Linear quantizer 

// Select quantizer group 



// Extract bits 

switch { nQType ) { 
case : 

// Case of VQ or ABIT=0 
break ; 

case 1 : // Huffman quantizers 

for (m=0; m<nNumSamplSub-subFr; m++, nSample++) 

pCQGroup- >ppQ [nSEL] -> InverseQ ( InputFrame , AUDIO [ch] [n] [nSample] 

break; 

case 2 : // Linear quantizers 

for (m=0; m<nNumSamplSub-subFr; m++, nSample++) { 
pCQGroup- >ppQ [nSEL] - >InverseQ (InputFrame , nCode) ; 

AUDI0[ch] [n] [nSample] = pCQGroup- >ppQ [nSEL] - >SignExtension (nCode) ; 

} 

break ; 

case 3 : // Block quantizers 

int nResidue; 
CBlockQ *pCBQ; 

pCBQ = kpCBlockQ [nABITS-1] ; // Select block quantizer 
for (m=0; m<nNumSamplSub- subFr/4 ; m++) { 
// Get block code 

pCQGroup- >ppQ [nSEL] - >InverseQ (InputFrame , nCode) ; 
// Lookup 4 samples for a single block code 
nResidue=pCBQ->LookUp (nCode, &AUDIO [ch] [n] [nSample] ) ; 
nSample += 4; 

} 

break; 

default: // No bits allocated 

} // End of Switch 
} // End of subband Loop 

} // End of channel Loop 

// Check for DSYNC 

if ( (nSub-sub-frame== (nSSC-1) ) || (ASPF==1) ) { 
SYNC = InputFrame. ExtractBits (16) ; 
if ( DSYNC != Oxffff) { 
nErrorFlag =5; // 5 = sync error 

printf ( "Wrong DSYNC %x detected at end of sub-frame 



sub-sub-frame %d\n\n", 
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X96 Extension Audio Data Arrays 


Size (Bits) 


DSYNC, nSub- frame, nSub- sub- frame ) ; 
} 




} 

} // End of sub- sub- frame loop 




N = 16*nSSC; 

// Select step size table 
if ( RATE == Oxlf ) 

pStepSizeTable = &StepSizeLossLess ; 
else 

pStepSizeTable = &StepSizeLossy ; 




// 

// Scale factor and step size 

for <ch=0; ch<nPCHS; ch++) { // Channels 

for (n=nSBS96; n<anSBE96 [ch] ; n++) { // Subbands 
pSubband = kaPrmCh [ch] . aSubband [n] ; 
// Reset assembled sample index 
nAssembledSamplelndex = NumADPCMCoef f ; 
// Bit allocation 
nABITS = ABITS96 [ch] [n]-l; 




if { nABITS>0 ) { 
// Look up step size 

pStepSizeTable- >LookUp (nABITS , rStepSize) ; 
/ / Scale factor 

rStepRMS = rStepSize * (real) SCALES96 [ch] [n] [0] ; 
for (m=0; m<N; m++, nAssembledSampleIndex++ ) 

pSubband- >raSample [nAssembledSamplelndex] = rStepRMS*AUDIO [ch] [n] [m] ; 

} 




// Inverse AD PCM 

if ( PMODE [ch] [n] != )// Only when prediction mode is on. 
pSubband- >InverseADPCM (2*CSubband : mNumSample) ; 




} // End of subband loop 
} // End of channel Loop 
// Update AD PCM history 
for (ch=0; ch<nPCHS; ch++) 

aPrmCh[ch] . UpdateADPCMHistory ( ) ; 




// Joint intensity coding and clear unused subbands 
for (ch=0; ch<nPCHS; ch++) { // Channels 
if ( JOINX96 [ch] >0 ) { // Joint subbands 
// Copy joint subbands 
nSourceCh = JOINX96 [ch] - 1 ; 

for (n=anSBE9S [ch] ; n<anSBE96 [nSourceCh] ; n++) { 
rJScale = JOIN SCALES96 [ch] [n] ; 

pSSubband= & (aPrmCh [nSourceCh] . aSubband [n] ) ; // Source subband 
pSubband = & (aPrmCh [ch] . aSubband [n] ) ; // Joint subband 
for (m=0; m<NumADPCMCoef f +N; m++) 

pSubband- >raSample [m] = rJScale*pSSubband- >raSample [m] ; 

} 




// Clear unused subbands 

for (n=anSBE96 [nSourceCh] ; n<NumSubband; n++) { // Subbands 
pSubband = & (aPrmCh [ch] . aSubband [n] ) ; 
for (m=0; m<NumADPCMCoef f +N; m++) 

pSubband- >raSample [m] = (real)0; 

} 


10 bits per 
applicable subband 


} 

else { // No joint subbands 
// Clear unused subbands 

for (n=anSBE96 [ch] ; n<NumSubband; n++) { // Subbands 
pSubband = & (aPrmCh [ch] . aSubband [n] ) ; 
for (m=0; m<NumADPCMCoef f +N; m++) 

pSubband- >raSample [m] = (real)0; 

} 




} 

} // End channel loop 

// 

// Clear unused subbands 

for (ch=0; ch<nPCHS; ch++) { // Channels 
for (n=0; n<nSBS96; n++) { // Subbands 
pSubband = & (aPrmCh [ch] . aSubband [n] ) ; 
for (m=0; m<NumADPCMCoef f +N; m++) 
pSubband- >raSample [m] = (real)0; 
} 

} 
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HFREQ96 

Some high frequency extension subbands are encoded using vector quantization (VQ). The encoder searches for the 
32-element vector with elements to 15 that best matches the vector of 16 subband samples, corresponding to the 
16 samples (at 96 kHz/64) from the current subsubframe. This vector is indexed by HFREQ96. One HFREQ06 is 
transmitted per one VQ encoded extension subband in each subsubframe. The 10-bit index HFREQ96 points to one of 1 
024 vectors each consisting of 32 elements (the VQ table used here is the same as the one used in the core HFREQ VQ 
see Page 72). 

NOTE: in the subbands with no allocated bits (ABIT=0) the subband samples are generated from the random 
samples and this operation is included in the pseudo code in Table 6-10. 

AUDIO 

The audio data are grouped as nSSC subsubframes, each consisting of 16 samples for each subband. A mid-tread linear 
quantizer indexed by ABITS quantizes each sample. The resultant quantization index may further be encoded by either 
a Huffman or block code. If it is not, it is included in the bitstream as 2's compliment. All this information is indicated 
by SEL. The (ABITS, SEL) pair then tells how the subband samples should be extracted from the bitstream. 

The resultant subband samples are then compensated by their respective quantization step sizes and scale factors. For 
the subbands that are ADPCM encoded, the samples of these subbands thus far obtained are actually the difference 
signals. Their real values shall be recovered through a reverse ADPCM process. In the subbands with ABIT=0, no audio 
codes are transmitted. 

At the end of each subsubframe, there may be a synchronization check word DS YNC = Oxffff depending on the flag 
ASPF in the frame header, but there shall be at least a DSYNC at the end of each subframe. 

6.1 .3.6 Interpolation of the LFE Channel Samples 

On the DTS encoder side, the LFE channel samples are encoded in the core encoder. No effort is made to represent 
higher frequency components or higher resolution in the extension encoder. Encoder first decimates input LFE samples 
by a factor of 64x resulting in an effective channel bandwidth of 375 Hz (352 Hz in case of 44,1 kHz sampling). These 
samples are furthermore quantized and transmitted in a DTS core bit-stream. Core decoder performs inverse 
quantization and 64x interpolation that generates reconstructed LFE samples at 48 kHz (44,1 kHz). The reconstructed 
LFE samples have significant frequency components of up to 375 Hz (352 Hz) and all other frequency components are 
at least 100 dB lower in levels. The LFE samples at 96 kHz (88,2 kHz) are generated in an extension decoder using 
interpolation by factor of 2x. The goal of this filter is to remove an image introduced by interpolation. Significant 
frequency components of this image lie between 47,625 kHz (43,748 kHz) and 48 kHz (44,1 kHz). Thus, a simple 5-tap 
linear phase FIR filter with a magnitude response shown in Figure 6-6 is used as the 2x interpolation filter in LFE 
channel. 
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Figure 6-6: Magnitude Response of LFE Interpolation 2x Filter at 96 kHz 

The coefficients of 2x LFE interpolation filter scaled by an interpolation factor of 2 are given in 
Table 6-11. 

Table 6-11 : LFE 2x Interpolation Filter Coefficients 

1 ,2553677676342990e-001 
4,999991 380021 6800e-001 
7,489281 7046880420e-001 
4,999991 380021 6800e-001 
1 ,2553677676342990e-001 



The near-perfect reconstruction 64-band cosine modulated filter bank is obtained by modulating the 1 024-tap FIR 
linear phase prototype filter. The signs of these filter coefficients were changed in a manner appropriate for an efficient 
implementation of polyphase filtering. In particular, the signs of all coefficients in every even indexed block of 
128 coefficients are changed, e.g. coefficients in ranges 129 to 256, 385 to 512, 641 to 768 and 897 to 1 024 changed 
their signs. The modified prototype filter coefficients are given in clause D.9. 

6.2 XBR - Extended Bit Rate Extension 

The original DTS core encoder allowed 5.1 channels of high quality audio (sampling frequency F s up to 48 kHz) to be 
encoded and subsequent 'extensions' to the core were added to allow (a) additional channels to be encoded and (b) high 
frequency elements to be encoded (Fs up to 96 kHz). 

This clause describes a Bit-rate Extension (XBR) that allows for the total core + extensions bit-rates to be larger than 
1,5 Mbps. This in itself is nothing new, but the difference is a guaranteed backward compatibility with the DTS core 
decoders already in the market (which can only handle DTS bit streams up to rates of 1,5 Mbps). This is achieved by 
the inclusion of the core substream at 1,5 Mbps, along with the extension substream that carries the XBR extension. The 
XBR extension enhances the quality of audio that has been encoded in the core substream, by means of allocating 
additional bit-pool for the encoding of residual signals. The residual signals carry the information about the original 
audio that has not been represented by the core substream data. 
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6.2.1 DTS Core Substream Encoder + XBR Extension Encoder 
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Figure 6-7: DTS Core Substream + XBR Extension Encoder 



The XBR extension involves encoding the residuals that are the difference between the original subband samples and 
the subband samples reconstructed by decoding the core substream data. 

The XBR encoding only occurs after the core substream encoding has completed. Therefore the XBR encoding has NO 
effect on the core substream encoding. 

The XBR encoding process is as follows: 

• Generate residuals. These values are just the difference between the original subband samples and the 
subband samples reconstructed by decoding the core substream data. 

• Bit allocation. Bits are then allocated in the bit rate extension. The subband power of the residuals is 
calculated and based on the XBR bit-pool. It divides it to nNumSbFrm by nXBRChannels, bit-pools one per 
subframe, in each channel. Division is uniform between the subframes and proportional to the channel power. 
The residual samples overwrite the core subband samples in psub_band_samples. 

• Quantize scales. The scale factors are then quantized using Table 5-24. 

• Quantize samples. The (residual) subband samples are then quantized. 

• Pack XBR extension. The encoded XBR data is then packed into the DTS frame. 



6.2.2 DTS XBR Bit Rate Extension Decoder 

In the decoder, the unpack block first separates the core substream and the extension subs-stream. Next the core 
substream data is decoded and the corresponding subband samples are reconstructed. The XBR decoder then assembles 
the residual subband samples carried in the XBR extension and adds them to the corresponding subband samples that 
have been reconstructed from the core substream data, as illustrated in Figure 6-8. The resulting subband samples are 
fed to the synthesis filter bank, where the decoded PCM audio samples are synthesized. For legacy decoders, which are 
not capable of decoding the XBR extension, the XBR data is ignored and the subband samples that have been 
reconstructed from the core substream data are fed to the synthesis filter bank, where the core substream decoded PCM 
audio samples are synthesized. 
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Figure 6-8: DTS XBR decoder preliminary unpacking 
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Code operation: 

UnpackHeaderXBR - extract XBR header information including TMODE bit. Set up pointers for the audio samples 
that follow. 

UnpackXBRSub-frame - unpack ABITS information and generate scale factors. Extract audio samples and scale them. 

AssembleXBRSubbands - assemble together residual subband samples adding them to the corresponding subband 
samples reconstructed from the core substream data. 

6.2.3 Extension (XBR) Bitstream Components 

The frame of XBR data is divided into a frame header and up to four channel sets. Each channel set has its own channel 
set header, which are all grouped together at the end of the frame header as shown in Figure 6-9. The CRC word is 
included at the end of frame header to allow detection of errors in the frame header. 
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Figure 6-9: XBR Data Frame Structure 

The channel set data is further subdivided into the subframes. Each subframe consists of a sub frame header and the 
audio data. The audio data is organized in subsubframes as shown in Figure 6-10. The number of subframes is the same 
for all channel sets and is equal to the number of subframes in the core frame. Similarly the number of subsubframes is 
the same for all subframes of all channel sets and is equal to the number of subsubframes within each subframe of the 
core frame. In other words, the subframe and subsubframe partitioning within the XBR frame follows the partitioning 
present in the core frame. 
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Figure 6-10: Channel Set Data Structure 



6.2.4 XBR Frame Header 



Table 6-12: XBR Frame Header Structure 



XBR Frame Header Syntax 



Size (Bits) 



SYNCXBR = ExtractBits (32) ; 

nHeaderSizeXBR = ExtractBits ( 6 ) +1 ; 

nuNumChSetsInXBR = ExtractBits (2 ) +1 ; 

for (nChSet=0; nChSet < nuNumChSetsInXBR; nChSet ++) 

pnuChSetFsize [nChSet] = ExtractBits ( 14 ) +1 ; 
nXBRTMODEFlag = (ExtractBits { 1 ) == 1) ? true : false; 
for (nChSet=0; nChSet < nuNumChSetsInXBR; nChSet ++) 

Extract ChannelSetSubHeaderXBR{ } 
ReservedHeaderXBR = ExtractBits (...) ; 
ByteAlignHeaderXBR = ExtractBits ( ... 7) ; 
nCRC16HeaderXBR = ExtractBits ( 16 ) ; 



32 
6 
2 

14 
1 

See XBR Channel Set 
Sub-Header Table 

0...7 
16 
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SYNCXBR (XBR Sync Word) 

The DWORD aligned XBR synchronization word has value 0x655e315e. During sync detection the 
nCRC16HeaderXBR checksum is used to further verify that the detected sync pattern is not a random alias. 

nHeaderSizeXBR (XBR frame header length) 

The size of the header in bytes from the SYNCXBR to the nCRC16HeaderXBR inclusive. This value determines the 
location of the channel set data block in the first channel set. This marker also designates the end of the field 
nCRC16HeaderXBR and allows quick location of the checksum at byte position nHeaderSizeXBR-2. 

nuNumChSetsInXBR (Number of Channel Sets) 

All channels within the XBR extension are organized in individually decodable channel sets. The nNumChSetalnXBR 
is the number of channel sets that are present in XBR. 

pnuChSetFsize (Channel Set Data Byte Size) 

The pnuChSetFsize[nChSet] indicates the total number of bytes in the data portion of the channel set (nChSet) in the 
XBR frame. Starting from the SYNCXBR and using the cumulative sum of nHeaderSizeXBR and the 
pnuChSetFsize[k] (over all channel sets k=0, ... nChSet -1) as offset, decoder may traverse to the beginning of channel 
set data block in the channel set nChSet. 

XBRTMODEFlag (TMODE used flag) 

If the XBRTMODEFlag is true, the TMODES that are present in the core stream (and if present in the XCH stream or 
XXCH stream) shall be used for extraction of the XBR stream. 

If XBRTMODEFlag is false, the TMODES present in the core channel extension stream shall be ignored by the XBR 
decoder. 

ReservedHeaderXBR (Reserved) 

This field is reserved for additional XBR header information. The decoder shall assume that this field is present and of 
unspecified length. Therefore, in order to continue unpacking the stream, the decoder shall skip over this field using the 
XBR header start pointer and the XBR header size nHeaderSizeXBR. 

ByteAlignHeaderXBR (Pad to BYTE boundary) 

This field ensures that the CRC16 field that follows is aligned to a byte boundary to allow fast table based CRC16 
calculation. Append '0's until bit position is a multiple of 8. 

nCRC16HeaderXBR (CRC16 of XBR frame header) 

It represents the 16-bit CRC check word of the entire XBR header from positions nHeaderSizeXBR to 
ByteAlignHeaderXBR inclusive. 

6.2.5 XBR Channel Set Sub-Header 



Table 6-13: XBR Channel Set Sub-Header Structure 



XBR Channel Set Sub-Header Syntax 


Size (Bits) 


nXBRCh = ExtractBits (3) + 1 ; 
nBits4MaxSubbands = ExtractBits (2 ) + 5 ; 
for (nCh=0; nCh< nXBRCh ; nCh++) 

naXBRActiveBands = ExtractBits (nBits4MaxSubbands ) + 1; 


3 
2 

nBits4MaxSubbands 



nXBRCh (Number of XBR channels) 

nXBRCh indicates the number of channels that are encoded in this XBR channel set. 
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nBits4MaxSubbands (Max subbands flag) 

The nBits4MaxSubbands flag indicates the number of bits used to indicate the number of active subbands in the XBR 
channel set. From this, the number of bits used for indicating the number of active subbands in each channel 
(naXBRActiveBands[ch]) can be calculated. This is a 2-bit field with [0,1,2,3] corresponding to values [5,6,7,8]. 

naXBRActiveBands (XBR Active subbands) 

This field indicates the number of subbands that are encoded in a particular channel of the XBR channel set. Encoding 
always starts from subband and it goes up to the subband XBR_ActiveBands-l (XBR_ActiveBands < 32 for Fs < 48 k 
or XBR_ActiveBands < 64 for 48 kHz < Fs < 96 kHz). 



6.2.6 XBR Channel Set Data 



Table 6-14: XBR Channel Set Data Syntax 



XBR Channel Set Data Syntax 



Size (Bits) 



for (nCh=0; nCh<nXBRCh; nCh++) { 

nNumABITSbits [nCh] = ExtractBits (2 ) +2 ; 

} 

// Unpack ABITSH 
nChSBIndex = ; 

for (nCh=0; nCh<nXBRCh; nCh++) { 
nTmp = nNumABITSbits [nCh] ; 

for (nSB=0; nSB<naXBRActiveBands [nCh] ; nSB++, nChSBIndex++) { 
ancAbitsHigh [nChSBIndex] = ExtractBits (nTmp) ; 

} 

} // end of channel loop 
for (nCh=0; nCh<nXBRCh; nCh++) { 

ancTemp [nCh] = InputFrame . ExtractBits ( 3 ) ; 
if (ancTemp [nCh] <1) 

return SYNC_ERROR_DEF ; 

} 

// Generate scale factors 
nChSBIndex = ; 

for (nCh=0; nCh<nXBRCh; nCh++) { 
// Select RMS table 
if ( SHUFF [nCh] == 6 ) 

pScaleTable = &RMS7Bit ; 
else 

pScaleTable = &RMS6Bit ; 
// nTmp is the number of bits used for each scale index 
nTmp = ancTemp [nCh] ; 

for (nSB=0; nSB<naXBRActiveBands [nCh] ; nSB++, nChSBIndex++) { 
if (ancAbitsHigh [nChSBIndex] >0) { 
// Pre-transient scale 
/ / Unpack Scale index 
nScalelnd = ExtractBits (nTmp) ; 

// Look-up scale factor 

if ( pScaleTable ->LookUp (nScalelnd, anScalesHigh [nChSBIndex<<l] ) ==NULL ) 

{ 

return SYNC_ERROR_DEF ; 

} 

// Post-transient scale 

if ( nXBRTMODE Fl ag * TMODE [nCh] [nSB] >0) { 

// Unpack Scale index 

nScalelnd = ExtractBits (nTmp) ; 

// Look-up scale factor 
if ( pScaleTable- >LookUp (nScalelnd, 
anScalesHigh [ (nChSBIndex<<l) +1] ) ==NULL ) { 
return SYNC_ERROR_DEF ; 
} 

} // if (ancAbitsHigh [nChSBIndex] >0) 
else{ 

anScalesHigh [nChSBIndex<<l] = ; 
anScalesHigh [ (nChSBIndex<<l) +1] = 0; 

} 



2 bits per channel 



Variable bits per channel 



3 bits per channel 



Variable per subband 
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6.2.6.1 Subframe Side Information 

The pseudo code for the subframe side information is depicted in Table 6-14. 
Number of bits for ABITS nNumABITSbits 

Indicates the number of bits used for encoding ABITSH. This is a 2-bit field with [0,1,2,3] corresponding to 
values [2,3,4,5]. 

XBR Bit allocation Index (ABITSH) 

ABITSH indicates the bit allocation indexes for all channels in the channel set and all subbands in the Bit-rate 
extension. 

Number of bits for Scale indices (nNumScalesbits) 

nNumScalesbits indicates the number of bits used for encoding Scale indices (3-bit field). The encoder uses either 6-bit 
or 7-bit square-root quantization table (as indicated by SHUFF[nCh]) to quantize the scale factors. The quantization 
indexes (6-bit or 7-bit) values are further analyzed to determine the largest index value (Max_Scale_Ind) for each 
channel in the current subframe. The efficient transmission of scale indexes is achieved by using nNumScalesbits = 
ceil( log 2 (Max_Scale_Ind) ) bits per index. 

XBR Scales indices (nScalelnd) 

First the scale factor quantization indexes are extracted as the nNumScalesbits bit words. The scale factors are obtained 
by using the quantization indexes to look up the square-root quantization table (this is the same table as for the core 
scale factors). Two scale factors are transmitted per each active extension subband if TMODE and nXBRTMODEFlag 
are set. The scale factors are not transmitted for the subbands with ABITSH=0. 
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6.2.6.2 



XBR Extension Residual Audio Data Arrays 



Table 6-15: XBR Extension Residual Audio Data 



XBR Extension Residual Audio Data 



Size (Bits) 



AUDIO 



// Unpack residual audio codes 

for (nSub- sub- f rame=0 ; nSub- sub- f rame<nSSC ; nSub- sub- f rame++ ) { 
nChSBIndex = ; 

for (nCh=0; nCh<nXBRCh; nCh++) { // Channels 
// Subbands 

for (nSB=0; nSB<naXBRActiveBands [nCh] ; nSB++, nChSBIndex+ + ) { 
nXBR_ABITS = ancAbitsHigh [nChSBIndex] ; 
nSmpl = nSub-sub-f rame<<3 ; 

if (nXBR_ABITS>0) { // Audio codes are present only for nlndex>0 
if (nXBR_ABITS>7) { 
// Linear quantizers 

nTmp = nXBR_ABITS-3 ; // Number of bits used for packing 

nShift = 32 - nTmp; 

for (nn=0; nn<8; nn++, nSmpl++) { 

// Extract audio residuals and do the sign extension 
nCode = InputFrame . ExtractBits (nTmp) <<nShift ; 
AUDIO [nCh] [nSB] [nSmpl] = nCode>>nShif t ; 
} 

} // end of nXBR_ABITS>7 
else { 

// Block encoded quantizers 

pCBQ = kpCBlockQ [nXBR_ABITS-l] ; // Select block quantizer 

// Get the length of one block code 

nTmp = nLenBlockCode [nXBR_ABITS-l] ; 

for (nn=0; nn<2; nn++) { 

nCode = InputFrame . ExtractBits (nTmp) ; 

nResidue = pCBQ- >LookUp (nCode , &AUDI0 [nCh] [nSB] [nSmpl]); 
// Lookup 4 samples for a single block code 
if ( nResidue != ) { 

printfC'XBR AUDIO: Block code error\n" ) ; 

return SYNC_ERROR_DEF ; 

} 

nSmpl += 4; 
} 

} // end of nXBR_ABITS<=7 
} // end of nXBR_ABITS>0 
else{ // nXBR_ABITS=0 

for (nn=0; nn<8; nn++, nSmpl++) 
AUDIO [nCh] [nSB] [nSmpl] = 0; 
} // end of nlndex condition 
} // End nSB loop 
} // End nCh loop 

// Check for DSYNC which is used to verify end of sub-frame or sub-sub-frame 
position 

if ( (nSub-sub-f rame== (nSSC-1) ) || (ASPF==1) ) { 
DSYNC = InputFrame. ExtractBits (16) ; 

if ( DSYNC != Oxffff) { 

return SYNC_ERR0R_DEF ; 

} 
} 

} // end nSub- sub- frame loop 



Variable per 
subband 



AUDIO (Audio Data) 

The residual audio samples are grouped as nSSC subsubframes, each consisting of eight samples for each subband. All 
samples in each subband of each channel may have been linear quantized, or block encoded - this is indicated by 
nXBR_ABITS. If nXBR_ABITS = then no audio residuals are present in the DTS stream for this subsubframe of this 
channel and subband. The corresponding residual array entries should be filled with zeros. 

At the end of each subsubframe there may be a synchronization check word DSYNC = Oxffff depending on the flag 
ASPF in the frame header, but there shall be at least a DSYNC at the end of each subframe. 
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6.2.7 Assembly of XBR subbands 

As per Figure 6-8, once the XBR information has been extracted, it shall be recombined with the 1,5 Mbps backward 
compatible 'core + extensions' stream prior to decoding of these streams. Table 6-16 shows the pseudo code 
implementation. 

Table 6-16: XBR Assembling Subbands 



XBR Assembling Subbands 

//Assembling subbands following XBR extraction and prior to core + extensions decoding 

// This function assembles together residual subband samples 

// and adds them to the corresponding core+XCh+X96 subband samples 

// stored in CSB . naSample [] arrays 

AssembleXBRSubbands (unsigned int nBands2Use) 

{ 

DTS int64 dAcc; 

// Select step size table 
pStepSizeTable = &StepSizeLossLess ; 

// Assemble subbands 
nAccChSBInd = ; 

for <nCh=0; nCh<nXBRCh; nCh++) { // Channels 

nBands2Use = (nBands2Use>naXBRActiveBands [nCh] ) ? naXBRActiveBands [nCh] : nBands2Use; 
nChSBIndex = nAccChSBInd; 

for (nSB=0; nSB<nBands2Use ; nSB++, nChSBIndex++ ) { // Subbands 

switch (nX96Present) { 
case : 

pSubbandSmpls = aPrmCh [nCh] . aSubband [nSB] . naSampleCore ; 
break ; 

default : 

pSubbandSmpls = aPrmCh [nCh] . aSubband [nSB] . naSampleExt ; 
break; 
} 

// Look up step size 

nABITS = ancAbitsHigh [nChSBIndex] ; 

if { pStepSizeTable- >LookUp (nABITS, nStepSize) ==NULL ) 
printf ( "ERROR: StepSize lookup failure ABITS=%d" , nABITS); 

// Set transient ( sub- sub- frame ) location 
nTmode = nXBRTMODEFlag*TMODE [nCh] [nSB] ; 

nSmplLim = { nTmode == ) ? CSubband: mNumSample : 8*nTmode; 

// Reconstruct residuals and add them to the existing (in naSample array) 
// subband samples; Store results back to the naSample array 

nRawSampleIndex=0 ; // Reset raw sample index 

nAssembledSamplelndex = NumADPCMCoef f ; // Reset assembled sample index 
// PreTransient 

dAcc = (DTS int64) nStepSize * anScalesHigh [nChSBIndex<<l] ; 

// Find location of the most significant "1" 
m=0 ; 

while (dAcoO ) { 

m++ ; 

dAcc=dAcc>>l ; 
} 

nShift = (m>SUBBSAMPLS_QRES) ? (m-SUBBSAMPLS_QRES) : 0; 

nStepRMS = (int) ( ( ((DTS intS4) nStepSize * anScalesHigh [nChSBIndex<<l] )) >>nShift ) ; 

nShift = 22-nShift; 

nRoundF = (nShift>0) ? 1<< (nShif t-1) : 0; 

for (m=0; m<nSmplLim; m++, nRawSampleIndex++ , nAssembledSampleIndex++) { 

dAcc = nRoundF + (DTS int64) nStepRMS*AUDIO [nCh] [nSB] [nRawSamplelndex] ; 

//limits check 

nTmp = (int) (dAcc>>nShif t) ; 
if (nTmp>nHigh) 

nTmp = nHigh; 
else if (nTmp<nLow) 

nTmp = nLow ; 

//write assembled value 

pSubbandSmpls [nAssembledSamplelndex] += nTmp; 

} 
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XBR Assembling Subbands 

// Af terTransient complete the same process as for pretransient 
} // SB loop 

nAccChSBInd += naXBRActiveBands [nCh] ; 
} / / channel loop 
return 1 ; 



6.3 Extension to 6.1 Channels (XCh) 

The XCh extension expands the core capability to encoding of 6.1 discrete channels in the configuration that assumes 
standard 5.1 layout plus the additional surround channel positioned directly behind the listener (180°) and denoted as a 
centre surround (Cs) channel. Additional requirement for the valid XCh stream is that 6.1 to 5.1 down-mix is embedded 
in the stream, i.e. the Cs channel is mixed into the left surround (Ls) and right surround (Rs) channels with the 
attenuation of -3 dB exclusively. This 5.1 down-mix is encoded in the core stream and the Cs channel is encoded in the 
XCh stream. This way in the 5.1 listening environment the Cs channel will be reproduced as the phantom image 
between the Ls and Rs speakers. The Cs channel in the XCh stream is compressed using exactly the same technology as 
the core audio channels. The audio data representing this extension channel (XCh stream) is appended to the end of the 
core audio data (core stream). This extension audio data is automatically ignored by first generation DTS decoders but 
can be decoded by second generation DTS decoders. 

6.3.1 Unpack Frame Header 



Table 6-17: XCH Frame header 



XCH Frame Header 


Size (Bits) 


XChSYNC = ExtractBits (32) ; 
XChFSIZE = ExtractBits (10) ; 
AMODE = ExtractBits (4) ; 


32 
10 
4 



XChSYNC (Channel Extension Sync Word) 

The synchronization word XChSYNC = 0x5a5a5a5a for the channel extension audio comes after all other extension 
streams (i.e. in case of multiple extension streams the XCh stream is always the last). For 16-bit streams, XChSYNC is 
aligned to a 32-bit word boundary. For 14-bit streams, it is aligned to both 32-bit and 28-bit word boundaries, meaning 
that the sync word appears as 0xl696e5a5 in the 28-bit stream and as 0x5a5a5a5a after this stream is packed into a 
32-bit stream. 

Since the pseudo sync word might appear in the bitstream, it is MANDATORY to check the distance between this sync 
and the end of the encoded bitstream. This distance in bytes should be equal to XChFSIZE+1. The parameter 
XChFSIZE is described below. 

NOTE: For compatibility reasons with legacy bitstreams the estimated distance in bytes is checked against the 

XChFSIZE+1 as well as the XChFSIZE. The XCh synchronization is pronounced if the distance matches 
either of these two values. 

XChFSIZE (Primary Frame Byte Size) 

(XChFSIZE+1) is the distance in bytes from current extension sync word to the end of the current audio frame. Valid 
range for XChFSIZE: 95 to 1 023. Invalid range: to 94. 

AMODE (Extension Channel Arrangement) 

Audio channel arrangement which describes the number of audio channels (CHS) and the audio playback arrangement. 
It is set to represent the number of extension channels. 
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6.3.2 Unpack Audio Header 



Table 6-18: XCH Audio header 



XCH Audio Header 



Size (Bits) 



PCHS = ExtractBits (3) ; 
nPCHS = PCHS + 1; 
for <ch=0; ch<nPCHS; ch++) { 

SUBS [ch] = ExtractBits (5) ; 

nSUBS[ch] = SUBS [ch] + 2 ; 

} 

for (ch=0; ch<nPCHS; ch++) { 
VQSUB[ch] = ExtractBits (5) ; 
nVQSUB[ch] = VQSUB [ch] + 1; 

} 

for (ch=0; ch<nPCHS; ch++) 
JOINX[ch] = ExtractBits (3) ; 

for (ch=0; ch<nPCHS; ch++) 
THUFF [ch] = ExtractBits (2) ; 

for (ch=0; ch<nPCHS; ch++) 

SHUFF[ch] = ExtractBits (3) ; 
for (ch=0; ch<nPCHS; ch++) 
BHUFF [ch] = ExtractBits (3) ; 
// ABITS=1: 
n=0; 

for (ch=0; ch<nPCHS; ch++) 

SEL[ch] [n] = ExtractBits (1) ; 
// ABITS = 2 to 5: 
for (n=l; n<5; n++) 

for (ch=0; ch<nPCHS; ch++) 

SEL[ch] [n] = ExtractBits (2) ; 
// ABITS = 6 to 10: 
for (n=5; n<10; n++) 

for (ch=0; ch<nPCHS; ch++) 

SEL[ch] [n] = ExtractBits (3) ; 
// ABITS = 11 to 26: 
for (n=10; n<26; n++) 

for (ch=0; ch<nPCHS; ch++) 

SEL[ch] [n] =0; // Not transmitted, set to zero. 
// ABITS = 1 : 
n = 0; 

for (ch=0; ch<nPCHS; ch++ 
if ( SEL [ch] [n] == ) 
// Extract AD J index 
ADJ = ExtractBits (2 ) ; 
/ / Look up ADJ table 
arADJ[ch] [n] = Ad j Table [ADJ] ; 

} 

// ABITS = 2 to 5 : 
for (n=l; n<5; n++) 

for (ch=0; ch<nPCHS; ch++ 
if { SEL [ch] [n] < 3 ) { 
// Extract ADJ index 
ADJ = ExtractBits (2 ) ; 
// Look up ADJ table 
arADJ[ch] [n] = Ad j Table [ADJ] ; 

} 

// ABITS = 6 to 10: 
for (n=5; n<10; n++) 

for (ch=0; ch<nPCHS; ch++) 

if ( SEL [ch] [n] < 7 ) { // Transmitted only when SEL<7 
/ / Extract ADJ index 
ADJ = ExtractBits (2) ; 
// Look up ADJ table 
arADJ[ch] [n] = Ad j Table [ADJ] ; 

} 

if ( CPF==1 ) // Present only if CPF=1. 
AHCRC = ExtractBits (16) ; 



// Transmitted only if SEL=0 (Huffman code used) 



// Transmitted only when SEL<3 



5 bits per 
channel 



5 bits per 
channel 

3 bits per 
channel 

2 bits per 
channel 

3 bits per 
channel 
3 bits per 
channel 



variable bits 



2 bits per 
occasion 



16 
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PCHS (Number of Extension Channels) 

This field indicates that there are nPCHS = PCHS+1 < 5 extension audio channels in the current frame. If AMODE flag 
indicates more than 5 channels apart from LFE, the additional channels are the extended channels and are packed 
separately in the extended data arrays. 

SUBS (Subband Activity Count) 

This field indicates that there are nSUBS[ch] = SUBS[ch]+2 active subbands in the audio channel ch. Samples in 
subbands above nSUBS[ch] are zero, provided that intensity coding in that subband is disabled. 

VQSUB (High Frequency VQ Start Subband) 

This field indicates that high frequency samples starting from subband nVQSUB[ch]=VQSUB[ch]+l are VQ encoded. 
High frequency VQ is used only for high frequency subbands, but it may go down to low frequency subbands for such 
audio episodes as silence. In case of insufficient MIPS, the VQs for the highest frequency subbands may be ignored 
without causing audible distortion. 

JOINX (Joint Intensity Coding Index) 

This field, JOINX[ch], indicates if joint intensity coding is enabled for channel ch and which audio channel is the 
source channel from which channel ch will copy subband samples. It is assumed that the source channel index is 
smaller than that of the intensity channel, (see Table 5-23). 

THUFF (Transient Mode Code Book) 

This field indicates which Huffman codebook was used to encode the transient mode data TMODE (See Table 5-23). 
SHUFF (Scale Factor Code Book) 

The scale factors of a channel are quantized nonlinearly using either a 6-bit (64-level, 2,2 dB per step) or a 7-bit 
(128-level, 1,1 dB per step) square root table, depending on the application. The quantization indexes may be further 
compressed by one of the five Huffman codes and this information is transmitted to the decoder by SHUFF[ch] 
(Table 5-24). 

BHUFF (Bit Allocation Quantizer Select) 

This field indicates the codebook that was used to encode the bit allocation index ABITS (to be transmitted later). See 
(Table 5-25). 

SEL (Quantization Index Codebook Select) 

After subband samples are quantized using a mid-tread linear quantizer, the quantization indexes may be further 
encoded using either entropy (Huffman) or block coding in order to reduce bit rate. Therefore, the subband samples 
may appear in the bit stream as plain quantization indexes (no further encoding), entropy (Huffman) codes, or block 
codes. For channel ch, the selection of a particular codebook for a mid-tread linear quantizer indexed by ABITS [ch] is 
transmitted to the decoder as SELfch] [ABITS [ch]]. No SEL is transmitted for ABITS[ch]>l 1, because no further 
encoding is used for those quantizers. The decoder can find out the particular codebook that was used using ABITS [ch] 
and SEL[ch] [ABITS [ch]] to look up (see Table 5-26). 

ADJ (Scale Factor Adjustment Index) 

A scale factor adjustment index is transmitted whenever a SEL value indicates a Huffman codebook. This index points 
to the adjustment values shown in Table 5-27. This adjustment value should be multiplied by the scale factor (SCALE). 

AHCRC (Audio Header CRC Check Word) 

If CPF = 1 then AHCRC shall be extracted from the bitstream. The CRC value test shall not be applied. 
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6.3.3 Unpack Subframes 



6.3.3.1 



Side Information 



Table 6-19: XCH side information 



XCH audio side information 



Size (Bits) 



for (ch=0; ch<nPCHS; ch++) 
for (n=0; n<nSUBS [ch] ; n++) 

PMODE[ch][n] = ExtractBits ( 1 ) ; 
int nVQIndex; 

for (ch=0; ch<nPCHS; ch++) 
for (n=0; n<nSUBS [ch] ; n++) 

if ( PMODE [ch] [n] >0 ) { // Transmitted only when AD PCM active 
// Extract the VQindex 
nVQIndex = ExtractBits ( 12 ) ; 

// Look up the VQ table for prediction coefficients. 
ADPCMCoeffVQ. Lookup (nVQIndex, PVQ [ch] [n] ) // 4 coefficients 

} 

for (ch=0; ch<nPCHS; ch++) { 

// BHUFF tells which codebook was used 
nQSelect = BHUFF [ch]; 

// Use this codebook to decode the bit stream for ABITS [ch] [n] 
for (n=0; n<nVQSUB [ch] ; n++) // Not for VQ encoded subbands . 
QABITS. ppQ [nQSelect] - >InverseQ ( InputFrame , ABITS [ch] [n] ) 

} 

// Always assume no transition unless told 
for (ch=0; ch<nPCHS; ch++) 

for (n=0; n<NumSubband; n++) 

TMODE [ch] [n] = 0; 
// Decode TMODE [ch] [n] 

if ( nSSC>l ) {// Transient possible only if more than one sub- sub- frame . 

for (ch=0; ch<nPCHS; ch++) { 
// TMODE [ch] [n] is encoded by a codebook indexed by THUFF [ch] 
nQSelect = THUFF [ch] ; 

for (n=0; n<nVQSUB [ch] ; n++) // No VQ encoded subbands 
if ( ABITS [ch] [n] >0 ) // Present only if bits allocated 
// Use codebook nQSelect to decode TMODE from the bitstream 
QTMODE .ppQ [nQSelect] >InverseQ ( InputFrame , TMODE [ch] [n] ) 



for (ch=0; ch<nPCHS; ch++) { 
// Clear SCALES 

for (n=0; n<NumSubband; n++) { 
SCALES [ch] [n] [0] = ; 
SCALES [ch] [n] [1] = ; 

} 

// SHUFF indicates which codebook was used to encode SCALES 
nQSelect = SHUFF [ch] ; 

// Select the root square table (SCALES were nonlinearly 

// quantized) . 

if ( nQSelect == 6 ) 

pScaleTable = &RMS7Bit ; // 7-bit root square table 
else 

pScaleTable = &RMS6Bit ; // 6-bit root square table 

// 

// Clear accumulation (if Huffman code was used, the difference 
// of SCALES was encoded) . 

// 

nScaleSum = ; 

// 

// Extract SCALES for Subbands up to VQSUB [ch] 

// 

for (n=0; n<nVQSUB [ch] ; n++) 

if ( ABITS [ch] [n] >0 ) { // Not present if no bit allocated 

// 

// First scale factor 

// 

// Use the (Huffman) code indicated by nQSelect to decode 
// the quantization index of SCALES from the bit stream 
QSCALES. ppQ [nQSelect] ->InverseQ( InputFrame, nScale) ; 
// Take care of difference encoding 

if ( nQSelect < 5 ) // Huffman encoded, nScale is the difference 
nScaleSum += nScale; // of the quantization indexes of SCALES. 



1 bit per 
subband 



12 bits per 
occurrence 



variable bits 



variable bits 



variable bits 
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XCH audio side information 



Size (Bits) 



else // Otherwise, nScale is the quantization 
nScaleSum = nScale; // level of SCALES. 

// Look up SCALES from the root square table 
pScaleTable->LookUp (nScaleSum, SCALES [ch] [n] [0] 



// 
// 
// 
if 



Two scale factors transmitted if there is a transient 

(TMODE [ch] [n] >0) { 
// Use the (Huffman) code indicated by nQSelect to decode 
// the quantization index of SCALES from the bit stream 
QSCALES .ppQ [nQSelect] - >InverseQ ( InputFrame , nScale) ; 
// Take care of difference encoding 

if ( nQSelect < 5 ) // Huffman encoded, nScale is the 

nScaleSum += nScale; // of SCALES, 
else // Otherwise, nScale is SCALES 

nScaleSum = nScale; // itself. 
// Look up SCALES from the root square table 
pScaleTable->LookUp (nScaleSum, SCALES [ch] [n] [1]) 

} 

} 

// 

// High frequency VQ subbands 

// 

for (n=nVQSUB [ch] ; n<nSUBS [ch] ; n++) { 

// Use the code book indicated by nQSelect to decode 
// the quantization index of SCALES from the bit stream 
QSCALES .ppQ [nQSelect] - >InverseQ (InputFrame, nScale) ; 
// Take care of difference encoding 

if ( nQSelect < 5 ) // Huffman encoded, nScale is the 

nScaleSum += nScale; // of SCALES, 
else // Otherwise, nScale is SCALES 

nScaleSum = nScale; // itself. 
// Look up SCALES from the root square table 
pScaleTable->LookUp (nScaleSum, SCALES [ch] [n] [0] ) ; 



for (ch=0; ch<nPCHS; 
if ( JOINX [ch] >0 ) 
JOIN_SHUFF [ch] = 
int nSourceCh; 
for (ch=0; ch<nPCHS; 



ch+ + ) 

// Transmitted only if joint subband coding enabled. 
ExtractBits (3) ; 



3 bits per 
channel 



ch+ + ) 

if ( JOINX [ch] >0 ) { // Only if joint subband coding enabled. 

nSourceCh = JOINX[ch]-l; // Get source channel. JOINX counts 

// channels as 1,2,3,4,5, so minus 1. 

nQSelect = JOIN_SHUFF [ch] ; // Select code book. 

for (n=nSUBS [ch] ; n<nSUBS [nSourceCh] ; n+ + ) { 

// Use the code book indicated by nQSelect to decode 

// the quantization index of JOIN_SCALES 

QSCALES. ppQ [nQSelect] ->InverseQ( InputFrame, nJScale) ; 

// Bias by 64 

nJScale = nJScale + 64; 

// Look up JOIN_SCALES from the joint scale table 
JScaleTbl .Lookup (nJScale, JOIN_SCALES [ch] [n] ) ; 

} 

} 

if ( CPF==1 ) // Present only if CPF=1. 
SICRC = ExtractBits (16) ; 



variable bits 



16 



PMODE (Prediction Mode) 

When PMODE[ch][n]=l, it indicates that ADPCM prediction is used (active) for subband n of extension audio channel 
[ch] and PMODE [ch][n]=0 otherwise. ADPCM shall be extracted from the bit stream for all subbands, but ADPCM 
reconstruction can be limited to the lowest 20 subbands if DSP does not have enough MIPS. 

PVQ (Prediction Coefficients VQ Address) 

This field indexes to the vector code book (clause D.10.1) to get the ADPCM prediction coefficients. It is transmitted 
only for subbands whose ADPCM is active. 
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ABITS (Bit Allocation Index) 

This field, ABITS [ch][n], is the index to the mid -tread linear quantizer that was used to quantize the subband samples 
for the n th subband of channel ch. ABITS [ch][n] may be transmitted as either a 4-bit or 5-bit word. When ABITS is 
encoded in a 4-bit word, it may be further encoded using one of the five Huffman codes. This encoding is the same for 
all subbands of each channel and is conveyed by BHUFF as shown in Table 5-25. There is no need to allocate bits for 
the high frequency subbands because they are encoded using VQ. 

TMODE (Transition Mode) 

This field, TMODE[ch][n], indicates if there is a transient inside a subframe (subband analysis window) for subband n 
of channel ch. If there is a transient (TMODE[ch][n]>0), it further indicates that the transition occurred in subsubframe 
(subband analysis subwindow) TMODE[ch][n] + 1. TMODE [ch][n] is encoded by one of the four Huffman codes and 
the selection of which is conveyed by THUFF (see Table 5-24). The decoder assumes that there is no transition 
(TMODE[ch][n]=0) for all subbands of all channels unless it is told otherwise by the bit stream. Transient does not 
occur in the following situations, so TMODE is not transmitted: 

• Only one subsubframe within the current subframe. This is because the time resolution of transient analysis is 
a subsubframe (subband analysis subwindow). 

• VQ encoded high frequency subbands. If there is a transient for a subband, it would not have been VQ 
encoded. 

• Subbands without bit allocation. If there is no need to allocate bits for a subband, there is no need to care about 
transient for it. 

SCALES (Scale Factors) 

One scale factor is transmitted for subbands without transient. Otherwise, two are transmitted, one for the episode 
before the transient and the other for after the transient. The quantization indexes of the scale factors may be encoded by 
Huffman code as shown in Table 5-24. If this is the case, they are difference-encoded before Huffman coding. The scale 
factors are finally obtained by using the quantization indexes to look up either the 6-bit or 7-bit square root quantization 
table according to Table 5-24. 

JOIN_SHUFF (Joint Subband Scale Factor Codebook) 

If joint subband coding is enabled (JOINX[ch]>0), JOIN_SHUFF[ch] selects which code book was used to encode the 
scale factors (JOIN_SCALES) which will be used when copying subband samples from the source channel to the 
current channel ch. These scale factors are encoded in exactly the same way as that for SCALES. Use Table 5-24 to 
look up the codebook. 

JOIN_SCALES (Scale Factors for Joint Subband Coding) 

The scale factors are used to scale the subband samples copied from the source channel (JOINX[ch]-l) to the current 
channel. The index of the scale factor is encoded using the code book indexed by JOIN_SHUFF[ch]. After this index is 
decoded, it is used to look up the table in clause D.3 to get the scale factor. No transient is permitted for jointly encoded 
subbands, so a single scale factor is included. The joint subbands start from the nSUBS of the current channel until the 
nSUBS of the source channel. 

SICRC (Side Information CRC Check Word) 

If CPF = 1 then SICRC shall be extracted from the bitstream. The CRC value test shall not be applied. 
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6.3.3.2 



Data Arrays 



Table 6-20: XCH audio data arrays 



XCH Data Arrays 



Size (Bits) 



for (ch=0; ch<nPCHS; ch++) 

for <n=nVQSUB [ch] ; n<nSUBS [ch] ; n+ + ) { 

// Extract the VQ address from the bit stream 
nVQIndex = ExtractBits ( 10 ) ; 

// Look up the VQ code book for 32 subband samples. 
HFreqVQ. Lookup (nVQIndex, HFREQ [ch] [n] ) ; 
// Scale and take the samples 

rScale = (real) SCALES [ch] [n] [0]; // Get the scale factor 
for (m=0; m<nSSC*8; m++, nSample++) 
aPrmCh[ch] .aSubband[n] .raSample[m] = rScale*HFREQ [ch] [n] [m] 



10 bits per 
applicable sub 
band 



Audio Data 



// Select quantization step size table 
if ( RATE == Oxlf ) 

pStepSizeTable = SStepSizeLossLess ; // Lossless quantization 
else 

pStepSizeTable = SStepSizeLossy ; // Lossy 
// Unpack the subband samples 

for (nSub-sub-f rame=0 ; nSub- sub- f rame<nSSC ; nSub- sub- f rame++ ) { 
for (ch=0; ch<nPCHS; ch++) 

for (n=0; n<nVQSUB [ch] ; n++) { // Not high frequency VQ subbands 

// 

// Select the mid-tread linear quantizer 

// 

nABITS = ABITS [ch] [n] ; // Select the mid-tread quantizer 
pCQGroup = SpCQGroupAUDIO [nABITS-1] ; // Select the group of 
// code books corresponding to the 
// the mid-tread linear quantizer. 

nNumQ = pCQGroupAUDIO [nABITS- 1] . nNumQ- 1 ; // Number of code 
// books in this group 

// 

// Determine quantization index code book and its type 

// 

// Select quantization index code book 
nSEL = SEL[ch] [nABITS-1] ; 
// Determine its type 

nQType =1; // Assume Huffman type by default 
if ( nSEL==nNumQ ) { // Not Huffman type 
if ( nABITS<=7 ) 

nQType =3; // Block code 
else 

nQType =2; // No further encoding 
} 

if ( nABITS==0 ) // No bits allocated 
nQType = ; 

// 

// Extract bits from the bit stream 

// 

switch ( nQType ) { 
case : //No bits allocated 
for (m=0; m<8; m++) 

AUDIO [m] = ; 
break ; 

case 1 : // Huffman code 
for (m=0; m<8; m++) 

pCQGroup->ppQ [nSEL] - >InverseQ ( InputFrame , AUDIO [m] ) ; 
break ; 

case 2 : // No further encoding 
for (m=0; m<8; m++) { 

// Extract quantization index from the bit stream 
pCQGroup- >ppQ [nSEL] - >InverseQ ( InputFrame , nCode) 
// Take care of 2's compliment 

AUDIO[m] = pCQGroup- >ppQ [nSEL] - >SignExtension (nCode ) ; 

} 

break; 

case 3 : // Block code 
pCBQ = &pCBlockQ [nABITS-1] ; // Select block code book 
m = ; 

for (nBlock=0; nBlock<2 ; nBlock++) { 

// Extract the block code index from the bit stream 
pCQGroup- >ppQ [nSEL] - >InverseQ ( InputFrame , nCode) 
// Look up 4 samples from the block code book 



variable bits 
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XCH Data Arrays 


Size (Bits) 


pCBQ->LookUp (nCode, &AUDI0 [m] ) 
m + = 4 ; 

} 

break; 

default: // Undefined 
print ("ERROR: Unknown AUDIO quantization index code book."); 

} 

} 

// Account for quantization step size and scale factor 
// Look up quantization step size 
nABITS = ABITS [ch] [n] ; 

pStepSizeTable- >LookUp (nABITS , rStepSize) ; 

// Identify transient location 

nTmode = TMODE [ch] [n] ; 

if { nTmode == ) // No transient 

nTmode = nSSC; 
// Determine proper scale factor 

if (nSub- sub- f rame<nTmode ) // Pre-transient 

rScale = rStepSize * SCALES [ch] [n] [0]; // Use first scale factor 

else // After-transient 

rScale = rStepSize * SCALES [ch] [n] [1]; // Use second scale factor 

// Adjustmemt of scale factor 

rScale *= arADJ[ch] [SEL [ch] [nABITS-1]]; // arADJ [ ] [ ] are assumed 1 

// unless changed by bit 

// stream when SEL indicates 

// Huffman code. 

// Scale the samples 

nSample = 8*nSub-sub-f rame; // Set sample index 
for (m=0; m<8; m++, nSample++) 

aPrmCh[ch] .aSubband[n] . aSample [nSample] = rScale*AUDIO [m] ; 
// Inverse AD PCM 

if { PMODE [ch] [n] != ) // Only when prediction mode is on. 

aPrmCh[ch] .aSubband[n] . InverseADPCM ( ) ; 
// Check for DSYNC 

if ( (nSub-sub-frame== (nSSC-1) ) || (ASPF==1) ) { 
DSYNC = ExtractBits (16) ; 
if { DSYNC != Oxffff ) 

printf ( "DSYNC error at end of sub- sub- frame #%d" , nSub- sub- frame ) ; 

} 

} 

} 





HFREQ (VQ Encoded High Frequency Subbands) 

At low bit rates, some high frequency subbands are encoded using vector quantization (VQ). The code book is given in 
clause D.10.2. Each vector from this code book consists of 32 subband samples, corresponding to the maximum 
possible subframe (4 normal subsubframes): 

4 subsubframe x 8 samples/subsubframe = 32 samples: 

If the current subframe is short of 32 samples, the remaining samples are padded with zeros and then vector-quantized. 
The vector address is then included in the bit stream. After the decoder picks up the vector address, it looks up the 
vector code book to get the 32 samples. But the decoder will only pick nSSCx8 out of the 32 samples and scale them 
with the scale factor SCALES. 

AUDIO (Audio Data) 

The audio data are grouped as nSSC subsubframes, each consisting of 8 samples for each subband. Each sample was 
quantized by a mid-tread linear quantizer indexed by ABITS. The resultant quantization index may further be encoded 
by either a Huffman or block code. If it is not, it is included in the bit stream as 2's compliment. All this information is 
indicated by SEL. The (ABITS, SEL) pair then tells how the subband samples should be extracted from the bit stream 
(Table 5-26). 

The resultant subband samples are then compensated by their respective quantization step sizes and scale factors. 
Special care is to be paid to possible transient in the subframe. If a transient is flagged by TMODE, one scale factor will 
be used for samples before the transient and the other one for the after the transient. 
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For some of the subbands that are ADPCM encoded, the samples of these subbands thus far obtained are actually the 
difference signals. Their real values are recovered through a reverse ADPCM process: 

• At end of each subsubframe there may be a synchronization check word DSYNC = Oxffff depending on the 
flag ASPF in the frame header, but there shall be at least a DSYNC at the end of each subframe. 



6.4 Extension to More Than 5.1 Channels (XXCH) 

The XXCH extension supports lossy encoding of more than 5.1 channels by combining the backward compatible core 
(of up to 5.1 channels) with up to 32 additional channels. The extended channels are compressed using the same 
technology as the core audio channels. The audio data representing these extension channels may be included either into 
a core substream or into an extension substream. The XXCH data within the core substream is automatically ignored by 
all DTS legacy decoders but can be decoded by DTS-HD decoders. 

The frame of XXCH data is divided into a frame header and up to four channel sets. Each channel set has its own 
channel set header as shown in Figure 6-1 1. The CRC word is included at the end of frame header to allow detection of 
errors in the frame header data. In addition, the CRC words may be included at the end of each channel set header to 
allow detection of errors in the channel set header data. 
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Figure 6-1 1 : XXCH Data Frame Structure 

The channel set data is furthermore subdivided into the subframes. Each subframe consists of a subframe header and the 
audio data. The audio data is organized in subsubframes as shown in Figure 6-12. The number of subframes is the same 
for all channel sets and is equal to the number of subframes in the core frame. Similarly the number of 
subsubframes is the same for all subframes of all channel sets and is equal to the number of subsubframes within each 
subframe of the core frame. In other words the subframe and subsubframe partitioning within the XXCH frame follows 
the partitioning present in the core frame. 
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Figure 6-12: Channel Set Data Structure 



6.4.1 XXCH Frame Header 



Table 6-21 : XXCH Frame Header Structure 



XXCH Frame Header Syntax 


Size (Bits) 


SYNCXXCh = ExtractBits (32) ; 
nuHeaderSizeXXCh = ExtractBits ( 6) +1 ; 

bCRCPresent4ChSetHeaderXXCh = (ExtractBits ( 1 ) ==1 ) ? true : false; 

nuBits4SpkrMaskXXCh = ExtractBits ( 5 ) +1 ; 

nuNumChSetsInXXCh = ExtractBits ( 2 ) +1 ; 

for (nChSet=0; nChSet < nuNumChSetsInXXCh; nChSet ++) 

pnuChSetFsizeXXCh [nChSet] = ExtractBits ( 14 ) +1 ; 
nuCoreSpkrActivityMask = ExtractBits (nuBits4SpkrMaskXXCh) ; 
ReservedHeaderXXCh = ExtractBits (...) ; 


32 

6 

1 

5 

2 

14*nuNumChSetslnXXCh 
nuBits4SpkrMaskXXCh 
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XXCH Frame Header Syntax 


Size (Bits) 


ByteAlignHeaderXXCh = ExtractBits ( ... 7); 
nCRC16HeaderXXCh = ExtractBits ( 16 ) ; 


0...7 
16 



SYNCXXCh (XXCH Sync Word) 

The DWORD aligned XXCH synchronization word has value 0x47004a03. 
nuHeaderSizeXXCh (XXCH frame header length) 

nuHeaderSizeXXCh is the size of the header in bytes from the SYNCXXCh to the nCRC16HeaderXXCh inclusive. 
This value determines the location of the first channel set header. This marker also designates the end of the field 
nCRC16HeaderXXCh and allows quick location of the checksum at byte position nuHeaderSizeXXCh-2. 

bCRCPresent4ChSetHeaderXXCh (CRC presence flag for channel set header ) 

When bCRCPresent4ChSetHeaderXXCh = true the 16-bit CRC word for the channel set header is present at the end of 
each channel set header. 

nuBits4SpkrMaskXXCh (Number of bits for loudspeaker mask) 

nuBits4SpkrMaskXXCh indicates how many bits are used for packing the loudspeaker layout mask 
nuXXChSpkrLayoutMask (packed using nuBits4SpkrMaskXXCh - 6 bits) and the nDownMixChMapMask (packed 
using nuBits4SpkrMaskXXCh bits). 

nuNumChSetsInXXCh (Number of Channel Sets) 

All channels within the XXCH extension are organized in individually decodable channel sets. The 
nNumChSetalnXXCh is the number of channel sets that are present in XXCH. 

pnuChSetFsizeXXCh (Channel Set Data Byte Size) 

The pnuChSetFsizeXXCh[nChSet] indicates the total number of data bytes in nChSet channel set of the XXCH frame. 
Starting from the SYNCXXCh and using the cumulative sum of nuHeaderSizeXXCh and the pnuChSetFsizeXXCh[k] 
(over all channel sets k=0, . . . nChSet -1) as offset, decoder may traverse to the beginning of channel set header data in 
the channel set nChSet. 

nuCoreSpkrActivityMask (Core Loudspeaker Activity Mask) 

The nuCoreSpkrActivityMask indicates which of the pre-defined loudspeaker positions apply to the audio channels 
encoded in the core portion of DTS-HD stream. In nominal case the core channel layout as indicated by the AMODE 
core parameter will agree with the layout indicated by the nuCoreSpkrActivityMask. However there may be cases 
where the two layouts are not identical and: 

• the information in the AMODE field of the core stream is used to determine the intended speaker layout for 
5.1 playback; 

• the information in nuCoreSpkrActivityMask and nuXXChSpkrLayoutMask when combined together is used to 
determine the intended speaker layout for 8 channel playback. 

As an example consider the channel layout, that is coded in DTS-HD stream, to be C, L, R, LFEj, L ss , R ss , L sr and R sr 
In this case channels are organized in two channels sets with the C, L, R, LFEj, L ss , R ss , being in the first (core) channel 
set and the L sr and R sr being in the second (XXCH) channel set. During the encode process the 7.1 to 5.1 down-mix will 
be embedded such that the L sr and R sr channels are mixed into the L ss and R ss channels. The resulting mixed channels 

are encoded in the core stream as Ls and Rs channels (AMODE=9 > C, L, R, Ls and Rs layout). A 5.1 decoder uses this 
AMODE to configure its decoded outputs to C, L, R, Ls and Rs layout. On the other hand a 7.1 decoder ignores the 
AMODE information from the core stream and uses instead the nuCoreSpkrActivityMask (C, L, R, LFEj, L ss and R ss ) 
and the nuXXChSpkrLayoutMask (L sr and R sr ) from the XXCh stream to get the original 7. 1 speaker layout (C, L, R, 
LFEp L ss , R sp L sr and R sr ) and configures its outputs accordingly. 

Each core encoded channel or channel pair, depending on the corresponding speaker position(s), sets the appropriate bit 
in a loudspeaker activity mask. Predetermined loudspeaker positions are described in Table 6-22. For example, 
nuSpkrActivityMask = OxF indicates activity of C, L, R, L s , R s and LFE l loudspeakers. 
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Table 6-22: Loudspeaker Masks - 
nuXXChSpkrLayoutMask/nuCoreSpkrActivityMask/DownMixChMapMask 



M/if4f!nn 

iNOiciiion 


Location Description 
(Approximate angle in horizontal plane) 


Corresponding bit in 
nuXXChSpkrLayoutMask 


Corresponding bit in 
nuL»oreopKrACiiviiyiviasK/ 
D o w n M i x C h M a p M a s k 


c 


f^pntrp in front of li^tpnpr (0\ 

wci i Li c hi 1 1 u i i l ui no lc i lui lul 


N/A 


nyonoooooi 

UAUUUUUUU 1 


L 


1 pft in frnnt (-V)) 

L-CIL III IIUIIL \ JU j 


N/A 


0x00000002 

UAUUUUUUUl. 


R 


Rinht in front (10) 

1 11 yl 11 III 1 1 UI 11 y\J\J J 


N/A 


0x00000004 


Ls 


1 pft ci irroi inH on ^irlp in rppr f-1 1 Ci\ 

1 C 11 uUI 1 UUI IU UI 1 JlUC III ICul \ 1 1 \J I 


N/A 


0x00000008 

UAUUUUUUUU 


Rs 
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Cs 
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0x00000040 

UAUUUUUUUU 


0x00000040 
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UAUUUUUUUU 
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0x10000000-0x80000000 


0x10000000-0x80000000 



ReservedHeaderXXCh (Reserved) 

This field is reserved for additional XXCH header information. The decoder shall assume that this field is present and 
of unspecified duration. Therefore, in order to continue unpacking the stream, the decoder shall skip over this field 
using the XXCH header start pointer and the XXCH header size nuHeaderSizeXXCh. 

ByteAlignHeaderXXCh (Pad to BYTE boundary) 

This field ensures that the CRC16 field that follows is aligned to a byte boundary to allow fast table based CRC16 
calculation. Append '0's until bit position is a multiple of 8. 

nCRC16HeaderXXCh (CRC16 of XXCH frame header) 

It represents the 16-bit CRC check word of the entire XXCH header from position nuHeaderSizeXXCh to 
ByteAlignHeaderXXCh inclusive. 
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.4.2 XXCH Channel Set Header 



Table 6-23: XXCh Channel Set Header Structure 



XXCH Channel Set Header Syntax 



Size (Bits) 



nuXXChChSetHeaderSize = ExtractBits ( 7 ) +1 ; 
nuChInChSetXXCh= ExtractBits ( 3 ) + 1; 

nuXXChSpkrLayoutMask = ExtractBits (nuBits4SpkrMaskXXCh-6) << 6; 
bDownMixCoef f CodeEmbedded = (ExtractBits ( 1 ) == 1) ? true : false; 
if (bDownMixCoef f CodeEmbedded) { 

bDownMixEmbedded = (ExtractBits ( 1 ) == 1) ? true : false; 
nDmixScaleFactor = ExtractBits ( 6 ) ; 
for (nCh=0; nCh<nuChInChSet ; nCh++) 

DownMixChMapMask [nCh] = ExtractBits (nuBits4SpkrMaskXXCh) ; 
for (nCh=0; nCh<nuChInChSet ; nCh++) { 

for (n=0, nCoef=0; n<nuBits4SpkrMaskXXCh; n++, nCoef ++) { 
if ( (DownMixChMapMask [nCh] >>n) & 0x1 ) 

DownMixCoef f s [nCh] [nCoef] = ExtractBits ( 7 ) ; 

} 

} 

} // End condition on bDownMixCoef f CodeEmbedded 
for (nCh=0; nCh<nuChInChSet ; nCh++) 

SUBS [nCh] = ExtractBits (5) ; 
for (nCh=0; nCh<nuChInChSet ; nCh++) 

VQSUB[nCh] = ExtractBits ( 5 ) ; 
for (nCh=0; nCh<nuChInChSet ; nCh++) 

JOINX[nCh] = ExtractBits (3) ; 
for (nCh=0; nCh<nuChInChSet ; nCh++) 

THUFF [nCh] = ExtractBits ( 2 ) ; 
for (nCh=0; nCh<nuChInChSet ; nCh++) 

SHUFF[nCh] = ExtractBits ( 3 ) ; 
for (nCh=0; nCh<nuChInChSet ; nCh++) 

BHUFF [nCh] = ExtractBits ( 3 ) ; 
// ABITS=1: 
n=0; 

for (ch=0; ch<nuChInChSetXXCh; ch++) 

SEL[ch] [n] = ExtractBits (1) ; 
// ABITS = 2 to 5: 
for (n=l; n<5; n++) 

for (ch=0; ch<nuChInChSetXXCh; ch++) 
SEL [ch] [n] = ExtractBits (2) ; 
// ABITS = 6 to 10: 
for (n=5; n<10; n++) 

for (ch=0; ch<nuChInChSetXXCh; ch++) 
SEL[ch] [n] = ExtractBits (3) ; 
// ABITS = 11 to 2G: 
for (n=10; n<26; n++) 

for (ch=0; ch<nuChInChSetXXCh; ch++) 

SEL[ch] [n] =0; // Not transmitted, set to zero. 
// ABITS = 1 : 
n = 0; 

for (ch=0; ch<nuChInChSetXXCh; ch++) { 
if ( SEL [ch] [n] == ) 

arADJ[ch] [n] = Adj Table [ExtractBits ( 2 )] ; 

} 

// ABITS = 2 to 5: 
for (n=l; n<5; n++) { 

for (ch=0; ch<nuChInChSetXXCh; ch++) { 
if ( SEL [ch] [n] < 3 ) 

arADJ[ch] [n] = Adj Table [ExtractBits ( 2 )] ; 

} 

// ABITS = 6 to 10: 
for (n=5; n<10; n++) { 

for (ch=0; ch<nuChInChSetXXCh; ch++) 
if ( SEL [ch] [n] < 7 ) 

arADJ[ch] [n] = Adj Table [ExtractBits ( 2 )] ; 

} 

ReservedHeaderChSet = ExtractBits (...) ; 
ByteAlignHeaderChSet = ExtractBits ( ... 
if (bCRCPresent4ChSeHeader ) 

nCRC16HeaderChSet = ExtractBits ( 1G ) 



7 

3 

nuBits4SpkrMaskXXCh-6 
1 

1 

6 

nuBits4SpkrMaskXXCh 



7) ; 



Variable bits 



0...7 
16 
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nuXXChChSetHeaderSize (Channel set header length) 

The size of the channel set header in bytes from the nuXXChChSetHeaderSize to either ByteAlignHeaderChSet (when 
bCRCPresent4ChSetHeaderXXCh = false) or nCRC16HeaderChSet (when bCRCPresent4ChSetHeaderXXCh = true) 
inclusive. This value determines the beginning of the channel set audio data. In case when the 
bCRCPresent4ChSetHeaderXXCh = true this marker also designates the end of the field nCRC16HeaderChSet and 
allows quick location of the checksum at byte position nuXXChChSetHeaderSize -2. 

nuChlnChSetXXCh (Number of channels in a channel set) 

Indicates the number of channels in the channel set. 
nuXXChSpkrLayoutMask (Loudspeaker Layout Mask) 

The nuXXChSpkrLayoutMask indicates which of the pre-defined loudspeaker positions apply to the audio channels 
encoded in a XXCh channel set. Each encoded channel/channel pair, depending on the corresponding speaker 
position/positions, sets the appropriate bit in a loudspeaker layout mask. Predetermined loudspeaker positions are 
described in Table 6-22. For example nuXXChSpkrLayoutMask =0x4040 indicates activity of Cs and Ch loudspeakers. 
This is in addition to the speakers encoded in the core portion of the DTS-HD stream. 

bDownMixCoeffCodeEmbedded (Downmix coefficients present in stream) 

If true it indicates that a matrix of downmix coefficients has been defined and is embedded in the stream. 
bDownMixEmbedded (Downmix already performed by encoder) 

Present only if bDownMixCoeffEmbedded is true. When bDownMixEmbedded=true this indicates to the decoder that 
on the encode side, audio in the channels of the current channel set (nChSet) have been down mixed to the core 
channels and to the channels in the lower indexed channel sets (ChSetIndex<nChSet). After decoding the current 
channel set the above mentioned encoder downmix operation needs to be undone in the decoder. If 
bDownMixEmbedded=false the encoder did not perform the downmixing operation on the current channel set. 

nDmixScaleFactor (Downmix scale factor) 

Present only if bDownMixCoeffEmbedded is true. The nDmixScaleFactor is a scaling coefficient that prevents an 
overflow and is applied to all output channels of the downmix. In the case when bDownMixCoeffEmbedded is true the 
nDmixScaleFactor has already been applied, on the encode side, to the output downmix channels. 

DownMixChMapMask (Downmix channel mapping mask) 

Present only if bDownMixCoeffEmbedded is true. Each channel of the current channel set may be mapped to any of the 
core and the channels in the lower indexed channel sets. For each channel (nCh) of the current channel set, the 
DownMixChMapMask[nCh] indicates the core channels and the channels in the lower indexed channel sets to which 
the channel nCh is going to be downmixed into. 

The DownMixChMapMask[nCh] has a dedicated bit (according to Table 6-22) for each of the core channels and each 
of the channels in the lower indexed channels sets. A channel nCh of the current channel set is mapped exclusively to 
those channels that have their bits in DownMixChMapMask [nCh] set to "1". Mapping to all other channels assumes 
downmix coefficient equal to -Infinity. 

DownMixCoeffs (Downmix coefficients) 

Present only if bDownMixCoeffEmbedded is true. For each channel (nCh) of the current channel set there is one 
downmix coefficient for each bit that is set to " 1 " in the corresponding channel mapping mask 
DownMixChMapMask[nCh]. These coefficients are used to multiply each sample of the channel nCh and add the 
product to the corresponding sample of the channel indicated by the DownMixChMapMask[nCh]. 

Coding of the downmix coefficients is described in clause D. 1 1 . 

SUBS (Subband Activity Count) 

This field indicates that there are nSUBS[ch] = SUBS[ch]+2 active subbands in the audio channel ch. Samples in 
subbands above nSUBS[ch] are zero, provided that intensity coding in that subband is disabled. 
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VQSUB (High Frequency VQ Start Subband) 

This field indicates that high frequency samples starting from subband nVQSUB[ch]=VQSUB[ch]+l are VQ encoded. 
High frequency VQ is used only for high frequency subbands, but it may go down to low frequency subbands for such 
audio episodes as silence. In case of insufficient MIPS, the VQs for the highest frequency subbands may be ignored 
without causing audible distortion. 

JOINX (Joint Intensity Coding Index) 

This field indicates if joint intensity coding is enabled for channel ch and which audio channel is the source channel 
from which channel ch will copy subband samples (see Table 5-22). It is assumed that the source channel index is 
smaller than that of the intensity channel. 

THUFF (Transient Mode Code Book) 

This field indicates which Huffman codebook was used to encode the transient mode data TMODE (see Table 5-23). 
SHUFF (Scale Factor Code Book) 

The scale factors of a channel are quantized nonlinearly using either a 6-bit (64-level, 2,2 dB per step) or a 7-bit 
(128-level, 1,1 dB per step) square root" table, depending on the application. The quantization indexes may be further 
compressed by one of the five Huffman codes and this information is transmitted to the decoder by SHUFF[ch] (see 
Table 5-24). 

BHUFF (Bit Allocation Quantizer Select) 

This field indicates the codebook that was used to encode the bit allocation index ABITS (see Table 5-25). 
SEL (Quantization Index Codebook Select) 

After subband samples are quantized using a mid-tread linear quantizer, the quantization indexes may be further 
encoded using either entropy (Huffman) or block coding in order to reduce bit rate. Therefore, the subband samples 
may appear in the bitstream as plain quantization indexes (no further encoding), entropy (Huffman) codes, or block 
codes. For channel ch, the selection of a particular codebook for a mid-tread linear quantizer indexed by ABITS [ch] is 
transmitted to the decoder as SEL[ch] [ABITS [ch]]. No SEL is transmitted for ABITS[ch]>l 1, because no further 
encoding is used for those quantizers. The decoder can find out the particular codebook that was used using ABITS [ch] 
and SEL[ch] [ABITS [ch]] to look up the table (see Table 5-26). 

ADJ (Scale Factor Adjustment Index) 

A scale factor adjustment index is transmitted whenever a SEL value indicates a Huffman codebook. This index points 
to the adjustment values shown in the table (see Table 5-27). This adjustment value should be multiplied by the scale 
factor (SCALE). 

ReservedHeaderChSet (Reserved) 

This field is reserved for additional channel set header information. The decoder shall assume that this field is present 
and of unspecified length. Therefore, in order to continue unpacking the stream, the decoder shall skip over this field 
using the channel set header start pointer and the channel set header size nuXXChChSetHeaderSize. 

ByteAlignHeaderChSet (Pad to BYTE boundary) 

This field ensures that the CRC16 field that follows is aligned to a byte boundary to allow fast table based CRC16 
calculation. Append '0's until bit position is a multiple of 8. 

nCRC16HeaderChSet (CRC16 of channel set header) 

This field is present only if the bCRCPresent4ChSetHeaderXXCh is true. It represents the 16-bit CRC check word of 
the entire channel set header from positions nuXXChChSetHeaderSize to ByteAlignHeaderChSet inclusive. 
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L2.1 



Unpack Subframes 



Table 6-24: XXCH Unpack Subframes 



XXCH Unpack Subframes 



Size (Bits) 



for <ch=0; ch<nuChInChSetXXCh; ch++) 
for (n=0; n<nSUBS [ch] ; n++) 

PMODE[ch] [n] = ExtractBits { 1 ) ; 
int nVQIndex; 

for (ch=0; ch<nuChInChSetXXCh; ch++) { 
for (n=0; n<nSUBS [ch] ; n++) 

if { PMODE [ch] [n] >0 ) { // Transmitted only when AD PCM active 
// Extract the VQindex 
nVQIndex = ExtractBits { 12 ) ; 

// Look up the VQ table for prediction coefficients. 
ADPCMCoeffVQ. Lookup (nVQIndex, PVQ [ch] [n] ) ; // 4 coefficients 

} 

} 

for (ch=0; ch<nuChInChSetXXCh; ch++) { 
// BHUFF tells which codebook was used 
nQSelect = BHUFF [ch] ; 

// Use this codebook to decode the bitstream for ABITS [ch] [n] 
for (n=0; n<nVQSUB [ch] ; n++) // Not for VQ encoded subbands . 
QABITS. ppQ [nQSelect] - >InverseQ ( InputFrame , ABITS [ch] [n] ) ; 

} 

// Always assume no transition unless told 
for <ch=0; ch<nuChInChSetXXCh; ch++) 

for (n=0; n<NumSubband; n++) 

TMODE [ch] [n] = ; 
// Decode TMODE [ch] [n] 

if { nSSC>l ) {// Transient possible only if more than one sub- sub- frame . 

for (ch=0; ch<nuChInChSetXXCh; ch++) { 
// TMODE [ch] [n] is encoded by a codebook indexed by THUFF [ch] 
nQSelect = THUFF [ch] ; 

for (n=0; n<nVQSUB [ch] ; n++) // No VQ encoded subbands 

if ( ABITS [ch] [n] >0 ) { // Present only if bits allocated 
// Use codebook nQSelect to decode TMODE from the bitstream 
QTMODE .ppQ [nQSelect] - >InverseQ ( InputFrame , TMODE [ch] [n] ) 

Scale Factors 

for (ch=0; ch<nuChInChSetXXCh; ch++) { 
// Clear SCALES 

for (n=0; n<NumSubband; n++) { 
SCALES [ch] [n] [0] = ; 
SCALES [ch] [n] [1] = ; 

} 

// SHUFF indicates which codebook was used to encode SCALES 
nQSelect = SHUFF [ch] ; 

// Select the root square table (SCALES were nonlinearly 

// quantized) . 

if ( nQSelect == S ) 

pScaleTable 
else 

pScaleTable = &RMS6Bit ; 

// 

// Clear accumulation (if Huffman code was used, the difference 
// of SCALES was encoded) . 

// 

nScaleSum = 0; 

// 

// Extract SCALES for Subbands up to VQSUB [ch] 

// 

for (n=0; n<nVQSUB [ch] ; n++) 

if { ABITS [ch] [n] >0 ) { // Not present if no bit allocated 

// 

// First scale factor 

// 

// Use the (Huffman) code indicated by nQSelect to decode 
// the quantization index of SCALES from the bitstream 
QSCALES. ppQ [nQSelect] ->InverseQ( InputFrame, nScale) ; 
// Take care of difference encoding 

if ( nQSelect < 5 ) // Huffman encoded, nScale is the difference 
nScaleSum += nScale; // of the quantization indexes of SCALES. 

else // Otherwise, nScale is the quantization 
nScaleSum = nScale; // level of SCALES. 

// Look up SCALES from the root square table 



1 bit per 
subband 



1 2 bits per 
occurrence 



Variable bits 



Variable bits 



Variable Bits 



&RMS7Bit; // 7-bit root square table 
// 6-bit root square table 
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XXCH Unpack Subframes 



Size (Bits) 



pScaleTable->LookUp (nScaleSum, SCALES [ch] [n] [0]) 

// 

// Two scale factors transmitted if there is a transient 

// 

if (TMODE [ch] [n] >0) { 

// Use the (Huffman) code indicated by nQSelect to decode 
// the quantization index of SCALES from the bitstream 
QSCALES .ppQ [nQSelect] - >InverseQ ( InputFrame , nScale) ; 
// Take care of difference encoding 

if ( nQSelect < 5 ) // Huffman encoded, nScale is the 

nScaleSum += nScale; // of SCALES, 
else // Otherwise, nScale is SCALES 

nScaleSum = nScale; // itself. 
/ / Look up SCALES from the root square table 
pScaleTable->LookUp (nScaleSum, SCALES [ch] [n] [1] ) ; 

} 

} 

// 

// High frequency VQ subbands 

// 

for (n=nVQSUB [ch] ; n<nSUBS [ch] ; n+ + ) { 

// Use the code book indicated by nQSelect to decode 
// the quantization index of SCALES from the bitstream 
QSCALES .ppQ [nQSelect] - >InverseQ (InputFrame, nScale) ; 
// Take care of difference encoding 

if ( nQSelect < 5 ) // Huffman encoded, nScale is the 

nScaleSum += nScale; // of SCALES, 
else // Otherwise, nScale is SCALES 

nScaleSum = nScale; // itself. 
// Look up SCALES from the root square table 
pScaleTable->LookUp (nScaleSum, SCALES [ch] [n] [0]); 

} 

for (ch=0; ch<nuChInChSetXXCh; ch++) 

if ( JOINX[ch]>0 ) // Transmitted only if joint subband coding enabled. 
JOIN_SHUFF [ch] = ExtractBits ( 3 ) ; 
int nSourceCh; 

for (ch=0; ch<nuChInChSetXXCh; ch++) { 

if ( JOINX[ch]>0 ) { // Only if joint subband coding enabled. 

nSourceCh = JOINX[ch]-l; // Get source channel. JOINX counts 

// channels as 1,2,3,4,5, so minus 1. 

nQSelect = JOIN_SHUFF [ch] ; // Select code book. 

for (n=nSUBS [ch] ; n<nSUBS [nSourceCh] ; n++) { 

// Use the code book indicated by nQSelect to decode 

// the quantization index of JOIN_SCALES 

QSCALES. ppQ [nQSelect] ->InverseQ( InputFrame, nJScale) ; 

// Bias by 64 

nJScale = nJScale + 64; 

// Look up JOIN_SCALES from the joint scale table 
JScaleTbl . Lookup (nJScale , J0IN_SCALES [ch] [n] ) ; 

} 

} 

if ( CPF==1 ) // Present only if CPF=1. 
SICRC = ExtractBits (16) ; 



3 bits per 
channel 



Variable bits 



6.4.2.2 Side Information 

Prediction Mode (PMODE) 

PMODE[ch][n]=l (1 bit per subband) indicates that ADPCM prediction is used (active) for subband n of extension 
audio channel [ch] and PMODE[ch][n]=0 otherwise. ADPCM shall be extracted from the bitstream for all subbands, 
but ADPCM reconstruction can be limited to the lowest 20 subbands if DSP does not have enough MIPS. 

Prediction Coefficients VQ Address (PVQ) 

This field (12 bits per active occurrence) indexes to the vector code book to get the ADPCM prediction coefficients, 
(see clause D.10.2). It is transmitted only for subbands whose ADPCM is active. 
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Bit Allocation Index (ABITS) 

ABITS[ch][n] (variable bits) is the index to the mid -tread linear quantizer that was used to quantize the subband 
samples for the n th subband of channel ch. ABITS [ch][n] may be transmitted as either a 4-bit or 5-bit word. When 
ABITS is encoded in a 4-bit word, it may be further encoded using one of the five Huffman codes. This encoding is the 
same for all subbands of each channel and is conveyed by BHUFF as shown in Table 5-26. There is no need to allocate 
bits for the high frequency subbands because they are encoded using VQ. 

Transition Mode (TMODE) 

TMODE[ch][n] (variable bits) indicates if there is a transient inside a subframe (subband analysis window) for subband 
n of channel ch. If there is a transient (TMODE[ch][n]>0), it further indicates that the transition occurred in 
subsubframe (subband analysis subwindow) TMODE[ch][n] + 1. TMODE [ch][n] is encoded by one of the four 
Huffman codes and the selection of which is conveyed by THUFF (see Table 6-7). The decoder assumes that there is no 
transition (TMODE[ch][n]=0) for all subbands of all channels unless it is told otherwise by the bitstream. Transient 
does not occur in the following situations, so TMODE is not transmitted: 

• Only one subsubframe within the current subframe. This is because the time resolution of transient analysis is 
a subsubframe (subband analysis subwindow). 

• VQ encoded high frequency subbands. If there is a transient for a subband, it would not have been VQ 
encoded. 

• Subbands without bit allocation. If there is no need to allocate bits for a subband, there is no need to care about 
transient for it. 

Scale Factors (SCALES) 

One scale factor (variable bits) is transmitted for subbands without transient. Otherwise two are transmitted, one for the 
episode before the transient and the other for after the transient. The quantization indexes of the scale factors may be 
encoded by Huffman code as shown in Table 5-24. If this is the case, they are difference-encoded before Huffman 
coding. The scale factors are finally obtained by using the quantization indexes to look up either the 6-bit or 7-bit square 
root quantization table according to Table 5-24. 

Joint Subband Scale Factor Codebook Select (JOIN SHUFF) 

If joint subband coding is enabled (JOINX[ch]>0), JOIN_SHUFF[ch] (3 bits per channel) selects which code book was 
used to encode the scale factors (JOIN_SCALES) which will be used when copying subband samples from the source 
channel to the current channel ch. For now, these scale factors are encoded in exactly the same way as that for 
SCALES, so use Table 5-24 to look up the codebook. 

Scale Factors for Joint Subband Coding (JOIN SCALES) 

The scale factors (variable bits) are used to scale the subband samples copied from the source channel (JOINX[ch]-l) to 
the current channel. The index of the scale factor is encoded using the code book indexed by JOIN SHUFF[ch]. After 
this index is decoded, it is used to look up the table in clause D.3 in to get the scale factor. No transient is permitted for 
jointly encoded subbands, so a single scale factor is included. The joint subbands start from the nSUBS of the current 
channel until the nSUBS of the source channel. 

Side Information CRC Check Word (SICRC) 

If CPF = 1 then SICRC shall be extracted from the bitstream. The CRC value test shall not be applied. 

6.4.2.3 Data Arrays 



Table 6-25: XXCH - Data Arrays 



XXCH Data Arrays 


Size (Bits) 


for (ch=0; ch<nPCHS; ch++) 

for (n=nVQSUB [ch] ; n<nSUBS [ch] ; n+ + ) { 

// Extract the VQ address from the bitstream 
nVQIndex = ExtractBits { 10 ) ; 

// Look up the VQ code book for 32 subband samples. 
HFreqVQ. Lookup (nVQIndex, HFREQ [ch] [n] ) 
// Scale and take the samples 


10 bits per subband 
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XXCH Data Arrays 



Size (Bits) 



Scale = (real) SCALES [ch] [n] [0]; // Get the scale factor 
for (m=0; m<nSSC*8; m++, nSample++) 

aPrmCh[ch] .aSubband[n] .raSample[m] = rScale*HFREQ [ch] [n] [m] 



AUDIO Section 



Variable bits 



// Select quantization step size table 
if { RATE == Oxlf ) 

pStepSizeTable = SStepSizeLossLess ; // Lossless quantization 
else 

pStepSizeTable = SStepSizeLossy ; // Lossy 

// 

// Unpack the subband samples 

for (nSub- sub- f rame=0 ; nSub- sub- frame <nSSC ; nSub- sub- f rame++ ) { 
for (ch=0; ch<nPCHS; ch++) 

for (n=0; n<nVQSUB [ch] ; n++) { // Not high frequency VQ subbands 

// 

// Select the mid-tread linear quantizer 

// 

nABITS = ABITS [ch] [n] ; // Select the mid-tread quantizer 
pCQGroup = kpCQGroupAUDIO [nABITS-1] ; // Select the group of 
// code books corresponding to the 
// the mid-tread linear quantizer. 

nNumQ = pCQGroupAUDIO [nABITS- 1] . nNumQ- 1 ; // Number of code 
// books in this group 

// 

// Determine quantization index code book and its type 

// 

// Select quantization index code book 
nSEL = SEL[ch] [nABITS-1] ; 
// Determine its type 

nQType =1; // Assume Huffman type by default 
if ( nSEL==nNumQ ) { // Not Huffman type 
if ( nABITS<=7 ) 

nQType =3; // Block code 

else 

nQType = 2; // No further encoding 

} 

if ( nABITS==0 ) // No bits allocated 
nQType = ; 

// 

// Extract bits from the bitstream 

// 

switch ( nQType ) { 
case : //No bits allocated 
for (m=0; m<8; m++) 

AUDIO [m] = ; 
break; 

case 1 : // Huffman code 
for (m=0; m<8; m++) 

pCQGroup- >ppQ [nSEL] - >InverseQ ( InputFrame , AUDIO [m] ) ; 
break; 

case 2 : // No further encoding 
for (m=0; m<8; m++) { 

// Extract quantization index from the bitstream 
pCQGroup- >ppQ [nSEL] - >InverseQ ( InputFrame , nCode) 
// Take care of 2's compliment 

AUDIO[m] = pCQGroup- >ppQ [nSEL] - >SignExtension (nCode) ; 

} 

break; 

case 3 : // Block code 
pCBQ = SpCBlockQ [nABITS-1] ; // Select block code book 
m = ; 

for (nBlock=0; nBlock<2 ; nBlock++) { 

// Extract the block code index from the bitstream 
pCQGroup- >ppQ [nSEL] - >InverseQ ( InputFrame , nCode) 
// Look up 4 samples from the block code book 
pCBQ->LookUp (nCode, &AUDIO [m] ) 
m += 4 ; 

} 

break; 

default: // Undefined 
printf ( "ERROR: Unknown AUDIO quantization index code book."); 

} 

} 

// 

// Account for quantization step size and scale factor 
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XXCH Data Arrays 


Size (Bits) 


// 

// Look up quantization step size 
nABITS = ABITS [ch] [n] ; 

pStepSizeTable- >LookUp (nABITS , rStepSize) ; 

// Identify transient location 

nTmode = TMODE [ch] [n] ; 

if { nTmode == ) // No transient 

nTmode = nSSC; 
// Determine proper scale factor 
if (nSub- sub- f rame<nTmode ) // Pre-transient 

rScale = rStepSize * SCALES [ch] [n] [0]; // Use first scale factor 
else // After-transient 

rScale = rStepSize * SCALES [ch] [n] [1]; // Use second scale factor 
// Adjustmemt of scale factor 

rScale *= arADJ[ch] [SEL [ch] [nABITS-1] ] ; // arADJ [ ] [ ] are assumed 1 

// unless changed by bit 

// stream when SEL indicates 

// Huffman code. 

// Scale the samples 

nSample = 8*nSub-sub-f rame; // Set sample index 
for (m=0; m<8; m++, nSample++) 

aPrmCh [ch] . aSubband [n] . aSample [nSample] = rScale*AUDIO [m] ; 

// 

// Inverse AD PCM 

// 

if { PMODE [ch] [n] != ) // Only when prediction mode is on. 
aPrmCh[ch] . aSubband [n] . InverseADPCM ( ) ; 

// 

// Check for DSYNC 

if { (nSub-sub-frame== (nSSC-1) ) || (ASPF==1) ) { 
DSYNC = ExtractBits (16) ; 
if { DSYNC != Oxffff ) 

printf ( "DSYNC error at end of sub- sub- frame #%d" , nSub- sub- frame ) ; 

} 

} 

} 





VQ Encoded High Frequency Subbands (HFREQ) 

At low bit rates, some high frequency subbands are encoded using vector quantization (VQ). The code book is given in 
clause D.10.2. Each vector from this code book consists of 32 subband samples, corresponding to the maximum 
possible subframe (4 normal subsubframes): 

4 subsubframe x 8 samples/subsubframe = 32 samples: 

If the current subframe is short of 32 samples, the remaining samples are padded with zeros and then vector- s, it looks 
up the vector code book to get the 32 samples. But the decoder will only pick nSSCx8 out of the 32 samples and scale 
them with the scale factor SCALES. 

Audio Data (AUDIO) 

The audio data are grouped as nSSC subsubframes, each consisting of eight samples for each subband. Each sample 
was quantized by a mid -tread linear quantizer indexed by ABITS. The resultant quantization index may be further 
encoded by either a Huffman or block code. If it is not, it is included in the bitstream as 2's compliment. All this 
information is indicated by SEL. The (ABITS, SEL) pair then tells how the subband samples should be extracted from 
the bitstream (see Table 6-8). 

The resultant subband samples are then compensated by their respective quantization step sizes and scale factors. 
Special care is to be paid to possible transient in the subframe. If a transient is flagged by TMODE, one scale factor is 
used for samples before the transient and the other one for the after the transient. 

For some of the subbands that are ADPCM encoded, the samples of these subbands thus far obtained are actually the 
difference signals. Their real values shall be recovered through a reverse ADPCM process: 

At the end of each subsubframe there may be a synchronization check word DSYNC = Oxffff depending on the flag 
ASPF in the frame header, but there shall be at least a DSYNC at the end of each subframe. 
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ReservedChSet (Reserved) 

This field is reserved for additional channel set information. The decoder shall assume that this field is present and of 
unspecified duration. In order to continue unpacking the stream, the decoder shall skip over this field by navigating 
nuXXChChSetHeaderSize bytes from the beginning of the XXCH header. 

ByteAlignChSet (Pad to BYTE boundary) 

This field insures that the XXCH extension ends on a byte boundary. to 7 bits which are set to are added to force 
byte boundary alignment. 



7 DTS Extension Substream Construction 

Building upon the foundation established by the original DTS codec technology, the DTS extension substream formats 
have added a modular architecture to include support for advanced features such as alternate channel maps, the ability 
to include replacement channels and metadata to permit authored control in the play back environment of how these 
streams are selected or combined. Additionally, the ability to support resolution enhancements and a new coding profile 
designed for higher efficiency, bring together a new comprehensive suite of technologies into one package. 

This expanded coding system, of which an implementation is currently commercialized as DTS-HD™, can support 
compatibility with pre-existing audio decoding systems through the presence of the core substream, enhancing 
performance of the core substream, or attain higher efficiency or a combination of efficiency and performance when 
direct compatibility with the original DTS core is not required. 



7.1 Relationship Between Core and Extension Substreams 

Organization into the core substream and the extension substreams is illustrated in Figure 7-1. 
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Figure 7-1 : Organization of the DTS-HD stream 



The core substream carries a DTS Coherent Acoustics component which is referred to as the core and may carry one 
extension as indicated in clause 5. 

An extension substream may consist of one or more of these components: 

• Coding components that represent the extensions to the core coding component present in the core substream. 
The use of these extensions enhances an audio presentation by providing features such as higher resolution and 
additional channels. 

• Additional audio assets that are mixed with the audio asset represented by the core substream and/or other 
assets from the extension substreams, creating a new audio presentation. 

• Stand-alone audio assets that represent an entire audio presentation. 

When compatibility to previously deployed DTS decoders is not required, the audio stream may consist of one or more 
extension substreams. 
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7.2 Audio Presentations and Audio Assets 

An extension substream has an architecture that allows several different audio presentations to be coded within a single 
extension substream. At the first level, each extension substream is organized in up to eight audio assets. Some 
examples are: 

• An integral part of an audio presentation that is mixed with another audio asset to create the complete audio 
presentation. For example, one audio asset may carry the music, effects and dialog while another asset may 
carry a director's commentary. The complete audio presentation is prepared by mixing the director's 
commentary asset with the music, effects and dialog asset. The instructions for mixing are transmitted in the 
associated metadata fields. 



• A set of loudspeaker signal feeds for a corresponding loudspeaker layout, such as, feed for left, right, left 
surround, right surround, centre and LFE loudspeakers in 5.1 loudspeaker layout. 

• A set of signals that describe the sound field but are not actual loudspeaker feeds. The actual loudspeaker feeds 
are derived from these signals using stream-embedded metadata and/or some signal processing, i.e. Ambisonic 
B-format where signals W, X, Y and Z are transmitted and the actual loudspeaker feeds are derived on the 
decoder side using linear mixing equations. 

Note that by default the core substream data belongs to the asset 0. 

The layout of an extension substream is illustrated in Figure 7-2. 
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Figure 7-2: Organization of the Extension Substream 



It is not necessary that all of the audio assets present in an audio stream be active at the same time. In particular, 
different audio presentations are defined by activating specific audio assets. The active assets are combined together as 
instructed by the stream metadata to create a particular audio presentation. For example: 

• Audio asset carries the primary audio presentation. 

• Audio asset 1 carries a director's commentary in English. 

• Audio asset 2 carries a director's commentary in Spanish. 

With this configuration of audio assets, one can define the following three audio presentations: 

• Activate the asset to create the primary audio presentation. 

• Activate the assets and 1 to create the primary audio presentation mixed with the director's commentary in 
English. 

• Activate the assets and 2 to create the primary audio presentation mixed with the director's commentary in 
Spanish. 

Furthermore, an audio presentation may consist of the active assets that are transmitted within different extension 
substreams. In particular, each extension substream can define up to eight different audio presentations. An audio 
presentation that is defined in the extension substream with index nExSS may include any audio asset from all of the 
extension substreams with an index less than or equal to nExSS. 
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For example: 

• Extension substream with index 0: 

Audio asset carries primary audio presentation. 
Audio asset 1 carries a director's commentary in English. 

• Extension substream with index 1 : 

Audio asset carries a director's commentary in Spanish. 

With the configuration of extension substreams and the audio assets within each of them (shown above), the following 
audio presentations can be created: 

• Audio presentations defined in extension substream 0: 

Primary audio presentation by activating the asset of extension substream 0. 

Primary audio + Secondary English by activating the assets and 1 of extension substream 0. 

• Audio presentations defined in extension substream 1 : 

Primary audio + Secondary Spanish by activating the asset of extension substream and asset of 
extension substream 1 . 

Note that different coding components of asset may be either in the core substream or in the extension substream. All 
assets other than asset shall have all their coding components within the same extension substream. 

The organization of an audio asset data is illustrated in Figure 7-3. 
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Figure 7-3: Organization of the Audio Asset Data 

7.2.1 Channel Sets 

In order to achieve scalability, the architecture of the extension substream allows the channels within the coding 
components to be organized into channel sets. Each channel set can be separately extracted and decoded as needed. An 
example of this would be when the intended speaker layout for a channel set, e.g. a 7.1 mix can be encoded in three 
channel sets such that all three of the following apply: 

• Decoding of a channel set produces a stereo downmix. 

• Decoding and combining of the channel sets and 1 produces a 5.1 downmix. 

• Decoding and combining of the channel sets 0, 1 and 2 produces the original 7.1 mix. 
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7.3 Synchronization and Navigation of the Substream 
7.3.1 Synchronization 

The list of possible sync words for core and substream components is provided in Table 7-1. 



Table 7-1 : Sync Words 



DTS_SYNCWORD_CORE 


0x7ffe8001 


DTS_SYNCWORD_XCH 


0x5a5a5a5a 


DTS_SYNCWORD_XXCH 


0x47004a03 


DTS_SYNCWORD_X96 


0x1d95f262 


DTS_SYNCWORD_XBR 


0x655e315e 


DTS_SYNCWORD_LBR 


0x0a801921 


DTS SYNCWORD XLL 


0x41 a29547 


DTS_S YN CWO R D_S U BSTR EAM 


0x64582025 


DTS_SYNCWORD_SUBSTREAM_CORE 


0x02b09261 



DTS_SYNCWORD_SUBSTREAM_CORE, which is located in the extension substream, has a sync word that is 
remapped from 0x7ffe8001 to 0x02b09261. This makes the core substream synchronization simpler and more robust. 
When this backward compatible core component is to be delivered to a legacy DTS decoders, (e.g. via SPDIF), its sync 
word needs to be restored from 0x02b09261 to 0x7ffe8001 prior to the transmission to the legacy decoders. 

7.3.2 Substream Navigation 

Parsing of the substream data depends on the error-free determination of the size (FSIZE) of each component. This 
FSIZE value, when accumulated, helps rapidly locate the start of each component. To ensure error free location, the 
FSIZE values in the substream header are protected by a checksum and, at a minimum, the decoder should attempt to 
decode the legacy core component to ensure some minimal level of audio output. 

Navigation through the individual components within the substream is achieved by extracting and accumulating the 
FSIZE of each component. An index table is gradually constructed allowing a decoder to rapidly locate the start of any 
data stream. After offsetting the pointer from the present position, the checksum field is immediately found and the data 
present at that location is verified for integrity before any further parsing and processing, as shown in Figure 7-4. 

If a component fails a checksum field-and if the substream header was intact-the index table allows the decoder to 
locate the next useful packet of data. However, interdependency between the data may be such that any extra data can 
no longer be utilized. At a minimum, the decoder should always attempt to decode the legacy core where present and 
augment it with relevant component data. 
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Figure 7-4: NAVI Synchronization 



7.4 Parsing Core Substream and Extension Substream Data 

Core substream and extension substream(s) need to be composed in a specific manner when being presented to a 
decoder, as depicted in Figure 7-5. 

When the data is sent to the decoder, both the sync words that define the start of the legacy core substream and the 
extension substream shall be aligned to 32-bit boundaries. The length of the core substream is byte-aligned, so the 
system layer may need to introduce from one to three null bytes between the core substream and the extension 
substream. If any additional substream, such as from an external file, is sent to the decoder, proper SYNC word 
alignment shall be maintained. 
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Figure 7-5: DWORD alignment of substream 



7.4.1 Extension Substream Header 

The extension substream header describes the audio assets that are present in the extension substream and the 
instructions for manipulating the various assets. The syntax of the extension substream header is described in Table 7-2, 
which follows. 

Table 7-2: Extension Substream Header Structure 



Extension Substream Header Structure 



Size (Bits) 



// Extract the sync word 
SYNCEXTSSH = ExtractBits (32) ; 
UserDef inedBits = ExtractBits ( 8 ) ; 
nExtSSIndex = ExtractBits (2 ) ; 

// Unpack the num of bits to be used to read header size 
bHeaderSizeType = ExtractBits ( 1 ) ; 
if (bHeaderSizeType == 0) { 

nuBits4Header = 8; 

nuBits4ExSSFsize = 16 

} 

else{ 

nuBits4Header = 12; 
nuBits4ExSSFsize = 20; 

} 

// Unpack the substream header size 

nuExtSSHeaderSize = ExtractBits (nuBits4Header) + 1; 
nuExtSSFsize = ExtractBits (nuBits4ExSSFsize) + 1; 
bStaticFieldsPresent = ExtractBits ( 1 ) ; 
if (bStaticFieldsPresent) { 

nuRef ClockCode = ExtractBits (2 ) ; 

nuExSSFrameDurationCode = 512* (ExtractBits ( 3 ) +1 ) ; 
bTimeStampFlag = ExtractBits ( 1 ) ; 
if (bTimeStampFlag) 
{ 

nuTimeStamp = ExtractBits ( 32 ) ; 
nLSB = ExtractBits (4 ) ; 

nuTimeStamp = ( (DTS int64) (nuTimeStamp<<4 ) ) | nLSB; 

} 

nuNumAudioPresnt = ExtractBits ( 3 ) +1 ; 

nuNumAssets = ExtractBits ( 3 ) +1 ; 

for (nAuPr=0; nAuPr<nuNumAudioPresnt ; nAuPr++) 

nuActiveExSSMask [nAuPr] = ExtractBits (nExtSSIndex+1 ) ; 
for (nAuPr=0; nAuPr<nuNumAudioPresnt ; nAuPr++) { 
for (nSS=0; nSS<nExtSSIndex+l ; nSS++) { 

if (( (nuActiveExSSMask [nAuPr] >>nSS) & 0x1) == 1) 

nuActiveAssetMask [nAuPr] [nSS] = ExtractBits ( 8 ) ; 
else 

nuActiveAssetMask [nAuPr] [nSS] = 0; 



32 

8 
2 



nuBits4Header 

nuBits4ExSSFsize 
1 

2 
3 
1 



36 



3 
3 

nExtSSIndex+1 
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Extension Substream Header Structure 



Size (Bits) 



bMixMetadataEnbl = ExtractBits ( 1 ) ; 
if (bMixMetadataEnbl) { 

nuMixMetadataAdj Level = ExtractBits (2 ) ; 
nuBits4MixOutMask = (ExtractBits (2 ) +1 ) <<2 ; 
nuNumMixOutConf igs = ExtractBits (2 ) + 1; 
// Output Mixing Configuration Loop 
for (ns=0; ns<nuNumMixOutConf igs ; ns++) { 

nuMixOutChMask [ns] = ExtractBits (nBits4MixOutMask) 
nNumMixOutCh [ns] = 

NumSpkrTableLookUp (nuMixOutChMask [ns] ) 

} 

} // End of if (bMixMetadataEnbl) 
} // End of if (bStaticFieldsPresent) 
else // bStaticFieldsPresent==f alse 
{ 

nuNumAudioPresnt = 1; 
nuNumAssets = 1; 

} 

for (nAst=0; nAst< nuNumAssets; nAst++) 

nuAssetFsize [nAst] = ExtractBits (nuBits4ExSSFsize ) +1 ; 
for (nAst=0; nAst< nuNumAssets; nAst++) 

AssetDescriptor { } 
for (nAuPr=0; nAuPr<nuNumAudioPresnt ; nAuPr++) 

bBcCorePresent [nAuPr] = ExtractBits ( 1 ) ; 
for (nAuPr=0; nAuPr<nuNumAudioPresnt ; nAuPr++) { 

if (bBcCorePresent [nAuPr] ) 

nuBcCoreExtSSIndex [nAuPr] = ExtractBits (2 ) ; 
nuBcCoreAssetlndex [nAuPr] = ExtractBits ( 3 ) ; 

} 

Reserved = ExtractBits (...) ; 
ByteAlign = ExtractBits ( ... 7) ; 
nCRC16ExtSSHeader = ExtractBits ( 16 ) ; 



nuBits4MixOutMask 



nuBits4ExSSFsizex nuNumAssets 
See Asset Descriptor in Table 7-5. 
1 

2 

3 

0...7 

16 



SYNCEXTSSH (Extension Substream Sync Word) 

The extension substream has a DWORD-aligned synchronization word with the hexadecimal value of 0x64582025. 
During sync detection, the nCRC16Header checksum (see Annex B) is used to further verify that the detected sync 
pattern is not a random alias. 

UserDefinedBits (User Defined Field) 

This field is reserved and may be used by an encoder operator. This field is not included in the Metadata CRC check 
and as such can be freely altered post encoding. This field represents the beginning of the metadata block and it is 
assumed that its start location within the encoded bit-stream is at the byte boundary. 

nExtSSIndex (Extension Substream Index) 

It is possible to have up to four extension substreams, originating from different sources, e.g. disc, Ethernet, broadcast, 
hard drive, etc., that are concatenated one after another. The nExtSSIndex parameter indicates an index of each 
extension substream and helps the decoder to differentiate between the different extension substreams. Its range is from 
Oto 3. 

bHeaderSizeType (Flag Indicating Short or Long Header Size) 

If bHeaderSizeType is 0, the header size is short (up to 256 Bytes) and is expressed using 8 bits. If the 
bHeaderSizeType is 1, the header size is long (up to 4 kBytes) and is expressed using 12 bits. 

nuExtSSHeaderSize (Extension Substream Header Length) 

This is the size of the extension substream header in bytes from the SYNCEXTSSH to nCRC16ExtSSHeader inclusive. 
This value determines the location of the first component of the first audio asset of the extension substream. This 
marker also designates the end of the field nCRC16ExtSSHeader and makes it possible to quickly locate the checksum 
at byte position nuExtSSHeaderSize-2. 
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nuExtSSFsize (Number of Bytes of Extension Substream) 

This is the number of bytes in the current frame of extension substream. This value is used to traverse to the end of the 
extension substream frame. 

bStaticFieldsPresent (Per Stream Static Fields Presence Flag) 

If this field is true, it indicates that the current frame includes the metadata fields that are static over the duration of an 
encoded stream. If the bStaticFieldsPresent is false, the metadata fields that are static over the duration of an encoded 
stream are omitted from the extension substream header. In this case, decoders should set the number of assets to 1 and 
the number of audio presentations to 1 . 

nuRefClockCode (Reference Clock Code) 

This field indicates the reference clock period. The reference clock period is used for calculating a frame duration and a 
decoder presentation time of an extension substream, as shown below in Table 7-3. The reference clock period 
(RefClockPeriod) is calculated from the extracted unsigned integer 2-bit field (nuRefClockCode) using a look-up table, 
as shown in Table 7-3. 



Table 7-3: Reference Clock Period 



nuRefClockCode 


RefClockPeriod [seconds] 





1,0/32 000.0 


1 


1,0/44 100.0 


2 


1,0/48 000.0 


3 


Unused 



nuExSSFrameDurationCode (Extension Substream Frame Duration) 

This field indicates the time duration between the two consecutive occurrences of the extension substream header. This 
duration is expressed by the number of clock cycles using the reference clock indicated by the value in RefClockPeriod. 
The number of clock cycles (nuExSSFrameDurationCode) is derived from the extracted unsigned integer 3-bit field by 
multiplying its value by 512. The actual duration in seconds (ExSSFrameDuration)is calculated from the 
nuExSSFrameDurationCode and the RefClockPeriod in the following manner: 

ExSSFrameDuration = nuExSSFrameDurationCode x RefClockPeriod. 

bTimeStampFlag (Timecode presence Flag) 

This is present only if bStaticFieldsPresent is true. When bTimeStampFlag = 1, the time code field is present. 
nuTimeStamp (Timecode data) 

This is present only if bStaticFieldsPresent and bTimeStampFlag are true. The timestamp data is a 36-bit field 
composed as follows: 

_ Hours x 3600 + Minsx 60 + Sec , 

nuTimeStamp = h SampleOffset 

RefClockPeriod 



Where all of the following are true: 

• The Hours has range to 23 

• The Mins has range to 59 

• The Sec has range to 59 

• The 1/ RefClockPeriod may be 32 000, 44 100, or 48 000, as deduced from Table 7-3 

• The SampleOffset has range of to 3 1 999, 44 099, or 47 999 respectively 

The timestamp of an encoded frame (N) corresponds to that time when the first edge of the first bit of the encoded 
frame (N) shall be clocked to the decoder; i.e. it is decoder presentation time for frame (N). 
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nuNumAudioPresnt (Number of Defined Audio Presentations) 

This is present only if bStaticFieldsPresent is true. It indicates the number of audio presentations that are defined in this 
extension substream. The audio presentations are defined in terms of active extension substreams-which may have an 
index from to nExtSSIndex-and active assets within each active extension substream. From one to eight audio 
presentations may be defined. 

nuNumAssets (Number of Audio Assets in Extension Substream) 

This is present only if bStaticFieldsPresent is true. It indicates the total number of audio assets that are encoded in an 
extension substream. An extension substream may consist of up to eight audio assets. 

nuActiveExSSMask (Active Extension Substream Mask for an Audio Presentation) 

This is present only if bStaticFieldsPresent is true. The audio assets from different extension substreams may be 
combined together to create an audio presentation. The location of "1" bits in nuActiveExSSMask indicates the indices 
of the active extension substreams that are used to create an audio presentation. If more than two extension substreams 
are to be combined together, the process of combining them is performed in stages, starting from the lowest index assets 
in the lowest index extension substream. Only the current and the lower indexed extension substreams may be 
combined into an audio presentation defined in the current extension substream. 

nuActiveAssetMask (Active Audio Asset Mask) 

This is present only if bStaticFieldsPresent is true. The audio assets from all active substreams are combined together to 
create an audio presentation. The location of "1" bits in nuActiveAssetMask[nAuPr][nSS] indicates the indices of audio 
assets in the extension substream with index nSS that are used to create the audio presentation with index nAuPr. 

If more than two audio assets are to be combined together, the process of combining them is performed in stages, 
starting from the lowest index assets. 

bMixMetadataEnbl (Mixing Metadata Enable Flag) 

This is present only if bStaticFieldsPresent is true. This field is true if at least one of the audio assets present in this 
substream extension is encoded for mixing or replacement. If bMixMetadataEnbl is false, all audio assets of this 
substream extension are encoded as standalone audio presentations. 

nuMixMetadataAdjLevel (Mixing Metadata Adjustment Level) 

This is present only if bStaticFieldsPresent and bMixMetadataEnbl are true. Both the system metadata, which includes 
the listener's preferences and the bit-stream metadata, can be present in a mixing audio application. The audio asset 
mixing features that can be adjusted by system metadata are: 

• Feature 1 : The level of an audio asset, relative to the level of other audio assets involved in mixing. 

• Feature 2: The placement of an audio asset within the sound field by means of altering the mixing coefficients. 

The mixing metadata adjustment level (nuMixMetadataAdjLevel) specifies which feature(s) (1 and/or 2 above), if any, 
may be adjusted, as described in Table 7-4. nuMixMetadataAdjLevel=3 is reserved for an additional level of adjustment 
that may be specified in the future. If nuMixMetadataAdjLevel=3, all decoders shall assume that, at minimum, the 
adjustment of features 1 and 2 is allowed. 



Table 7-4: Allowed Mixing Metadata Adjustment Level 



NuMixMetadataAdjLevel 


Metadata Usage 





Use only bitstream metadata 


1 


Allow system metadata to adjust feature 1 


2 


Allow system metadata to adjust both feature 1 and feature 2 


3 


Reserved 
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nuBits4MixOutMask (Number of Bits for Mixer Output Speaker Activity Mask) 

This is present only if both bStaticFieldsPresent and bMixMetadataEnbl are true. Its value indicates how many bits are 
used for packing a mixer output speaker activity mask nuMixOutChMask. Valid values for nuBits4MixOutMask are 4, 
8, 12 and 16 and are obtained from the extracted 2-bit code using the following mapping: 

nuBits4MixOutMask = (code+l)«2. 

nuNumMixOutConfigs (Number of Mixing Configurations) 

This is present only if both bStaticFieldsPresent and bMixMetadataEnbl are true. nuNumMixOutConfigs indicates the 
number of metadata sets (each corresponding to a different mixer output speaker configuration) included in the 
metadata of each mixing audio asset. 

The content provider can transmit mixing metadata that controls the mixing for several speaker configurations that can 
be extracted from the main audio stream. For example, within a 10.2 channels main audio presentation, it is likely that 
5.1 and 7.1 downmixes are already embedded in the main audio stream. In this case, the content provider may choose to 
provide two or even three sets of mixing metadata to do one or more of the following: 

• Control mixing of the 5.1 main audio asset downmix with the supplemental audio asset. 

• Control mixing of the 7.1 main audio asset downmix with the supplemental audio asset. 

• Control mixing of the 10.2 main audio asset with the supplemental audio asset. 

A mixer in its minimal implementation may perform the mixing outlined in case 1, above, followed by the downmix to 
2 channels. However an advanced player may be able to support the mixing outlined in case 2 or 3, above. 

nuMixOutChMask (Speaker Layout Mask for Mixer Output Channels) 

This is present only if both bStaticFieldsPresent and bMixMetadataEnbl are true. This field defines the channel layout 
for mixer output channels. This information is needed to associate the mixing coefficients with appropriate channels. 
(See description of a nuSpkrActivityMask in Table 7-10.) 

nuAssetFsize (Size of Encoded Asset Data in Bytes) 

This is the number of bytes for an encoded audio asset in current frame. The audio asset descriptor metadata is not 
included in nuAssetFsize. 

In particular, the beginning of data that is included in the nuAssetFsize[0] for audio asset is at the offset of 
nuExtSSHeaderSize bytes from the beginning of the extension substream header. The beginning of encoded data for 
audio asset 1 is at the offset of nuExtSSHeaderSize+nuAssetFsize[0] bytes, etc. 

Notice that when an audio asset's data is split between the core substream and the extension substream, the 
nuAssetFsize does not include the core data. 

AssetDescriptor (Audio Asset Descriptor) 

This is the audio asset descriptor is present for each encoded audio asset. Each audio asset may represent different audio 
asset types, such as Music and Effects, Dialog and Commentary. Each audio asset may be encoded with a different 
number of channels, a different sampling frequency and a different bit-width. The audio asset descriptor is a block of 
metadata that describes each of these parameters. Details are described in clause 7.4.2. 
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bBcCorePresent (Backward Compatible Core Present) 

By default, the core substream is sent over SPDIF to guarantee backward compatibility. However, in the case of a 
stream with multiple presentations, an alternate audio presentation may have a backward compatible core component 
that resides in an extension substream. When the bBcCorePresent flag is true for a particular audio presentation, the 
backward compatible core is present in the stream. In this case, the field nuBcCoreExtSSIndex and the field 
nuBcCoreAssetlndex indicate, respectively, the index of an extension substream and the index of an audio asset that 
contains the backward compatible core. By default the core sub-stream data, if present, has the 

nuBcCoreExtSSIndex = and the nuBcCoreAssetlndex = 0. Based on the active presentation, the internal decoder shall 
determine which backward compatible core shall be parsed out of the DTS-HD stream and delivered via SPDIF to the 
backward compatible decoder. If the backward compatible core has been extracted from the extension sub-stream then 
prior to sending it via SPDIF the internal decoder shall replace its existing sync word by the value of 0x7FFE8001 
(backward compatible sync word). The amount of data that is to be transmitted via SPDIF is determined from: 

• the core frame header parameter FSIZE, for the case when the backward compatible core is obtained from the 
core sub-stream; 

• the asset descriptor parameter nuExSSCoreFsize parameter, for the case when the backward compatible core is 
obtained from the extension sub-stream. 

nuBcCoreExtSSIndex (Backward Compatible Core Extension Substream Index) 

This field is present only when bBcCorePresent is true. This field indicates the index of the extension substream that 
contains an asset with a backward compatible core for a specific audio presentation. 

nuBcCoreAssetlndex (Backward Compatible Core Asset Index) 

This field is present only when bBcCorePresent is true. This field indicates the index of an audio asset that contains the 
backward compatible core for a specific audio presentation. 

Reserved (Reserved) 

This field is reserved for additional extension substream header information. The decoder shall assume that this field is 
present and of unspecified duration. Therefore in order to continue unpacking the stream, the decoder shall skip over 
this field using the extension substream header start pointer and the extension substream header size 
nuExtSSHeaderSize. 

nCRC16ExtSSHeader (CRC16 of Extension Substream Header) 

nCRC16ExtSSHeader is the CRC16 of the entire extension substream header from positions nExtSSIndex to ByteAlign, 
inclusive. See Annex B for details of the CRC algorithm used. 

7.4.2 Audio Asset Descriptor 

The audio asset descriptor is present for each encoded audio asset. The audio asset descriptor is a block of metadata that 
describes each of these parameters. Its syntax is provided in Table 7-5, Table 7-6 and Table 7-7. 

Note that a new pseudo-function, 'unsigned int CountBitsSet_to_l (unsigned int nuWord)' is introduced, first appearing 
in Table 7-5. The function calculates the number of bits that are set to "1" in the parameter nuWord. 



Table 7-5: Audio Asset Descriptor Syntax: 
Size, Index and Per Stream Static Metadata 



Audio asset Descriptor Syntax 


Size (Bits) 


nuAssetDescriptFsize = ExtractBits ( 9 ) +1 ; 


9 


nuAssetlndex = ExtractBits ( 3 ) ; 


3 


if (bStaticFieldsPresent) { 


1 


bAssetTypeDescrPresent = ExtractBits ( 1 ) ; 


if (bAssetTypeDescrPresent) 


4 


nuAssetTypeDescriptor = ExtractBits (4 ) ; 




bLanguageDescrPresent = ExtractBits ( 1 ) ; 


1 


if (bLanguageDescrPresent) 


24 


LanguageDescriptor = ExtractBits (24 ) ; 


blnfoTextPresent = ExtractBits ( 1 ) ; 


1 


if (blnfoTextPresent) 


10 


nulnf oTextByteSize = ExtractBits ( 10 ) +1 ; 
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Audio asset Descriptor Syntax 



Size (Bits) 



if (blnfoTextPresent) 

Inf oTextString = ExtractBits (nulnf oTextByteSize*8 ) ; 
nuBitResolution = ExtractBits { 5 ) + 1; 
nuMaxSampleRate = ExtractBits (4 ) 
nuTotalNumChs = ExtractBits ( 8 ) +1 ; 
bOne20neMapChannels2Speakers = ExtractBits ( 1 ) ; 
if (bOne20neMapChannels2Speakers) { 

if (nuTotalNumChs>2 ) 

bEmbeddedStereoFlag = ExtractBits ( 1 ) ; 

else 

bEmbeddedStereoFlag = ; 
if (nuTotalNumChs>6 ) 

bEmbeddedSixChFlag = ExtractBits ( 1 ) ; 
else 

bEmbeddedSixChFlag = ; 
bSpkrMaskEnabled = ExtractBits ( 1 ) ; 
if (bSpkrMaskEnabled) 

nuNumBits4SAMask = (ExtractBits (2 ) +1 ) <<2 ; 
if (bSpkrMaskEnabled) 

nuSpkrActivityMask = ExtractBits (nuNumBits4SAMask) ; 
nuNumSpkrRemapSets = ExtractBits ( 3 ) ; 
for (ns=0; ns<nuNumSpkrRemapSets ; ns++) 

nuStndrSpkrLayoutMask [ns] = ExtractBits (nuNumBits4SAMask) ; 
for (ns=0 ; ns<nuNumSpkrRemapSets ; ns++) { 

nuNumSpeakers = NumSpkrTableLookUp (nuStndrSpkrLayoutMask [ns] ) ; 

nuNumDecCh4Remap [ns] = ExtractBits ( 5 ) +1 ; 

for (nCh=0; nCh<nuNumSpeakers ; nCh++) { // Output channel loop 
nuRemapDecChMask [ns] [nCh] = ExtractBits (nuNumDecCh4Remap [ns] 
nCoef = CountBitsSet_to_l (nuRemapDecChMask [ns] [nCh] ) ; 
for (nc=0; nc<nCoef; nc++) 

nuSpkrRemapCodes [ns] [nCh] [nc] = ExtractBits ( 5 ) ; 
} // End output channel loop 
} // End nuNumSpkrRemapSets loop 
} // End of if (bOne20neMapChannels2Speakers) 
else{ // No speaker feed case 
bEmbeddedStereoFlag = false ; 
bEmbeddedSixChFlag = false; 
nuRepresentationType = ExtractBits (3 ) ; 

} 

// End of if (bStaticFieldsPresent) 



nulnfoTextByteSizex8 

5 
4 
8 
1 



1 

2 

nuNumBits4SAMask 
3 

nuNumBits4SAMask 



nuNumDecCh4Remap[ns] 
5 



Table 7-6: Audio Asset Descriptor Syntax: 
Dynamic Metadata - DRC, DNC and Mixing Metadata 



Audio asset Descriptor Syntax 



Size (Bits) 



bDRCCoef Present = ExtractBits ( 1 ) ; 
if (bDRCCoef Present) 

nuDRCCode = ExtractBits ( 8 ) ; 
bDialNormPresent = ExtractBits ( 1 ) ; 
if (bDialNormPresent) 

nuDialNormCode = ExtractBits ( 5 ) ; 
if (bDRCCoef Present && bEmbeddedStereoFlag) 

nuDRC2ChDmixCode = ExtractBits ( 8 ) ; 
if (bMixMetadataEnbl) 

bMixMetadataPresent = ExtractBits ( 1 ) ; 
else 

bMixMetadataPresent = false ; 
if (bMixMetadataPresent) { 

bExternalMixFlag = ExtractBits ( 1 ) ; 
nuPostMixGainAdj Code = ExtractBits ( 6 ) ; 
nuControlMixerDRC = ExtractBits (2 ) ; 
if (nuControlMixerDRC <3) 

nuLimit4EmbeddedDRC = ExtractBits ( 3 ) ; 
if (nuControlMixerDRC ==3) 

nuCustomDRCCode = ExtractBits ( 8 ) ; 
bEnblPerChMainAudioScale = ExtractBits ( 1 ) ; 
for (ns=0; ns<nuNumMixOutConf igs ; ns++) { 
if (bEnblPerChMainAudioScale) { 

for (nCh=0; nCh<nNumMixOutCh [ns] ; nCh++) 

nuMainAudioScaleCode [ns] [nCh] = ExtractBits ( 6 ) ; 

} 

else 
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Audio asset Descriptor Syntax 


Size (Bits) 


nuMainAudioScaleCode [ns] [0] = ExtractBits { 6 ) ; 

} 

nEmDM = 1 ; 

nDecCh[0] = nuTotalNumChs ; 
if (bEmbeddedSixChFlag) { 

nDecCh [nEmDM] = 6 ; 

nEmDM = nEmDM + 1; 

} 

if (bEmbeddedStereoFlag) { 
nDecCh [nEmDM] = 2 ; 
nEmDM = nEmDM + 1; 

} 

for (ns=0; ns<nuNumMixOutConf igs ; ns++) { //Configuration Loop 
for { nE=0; nE<nEmDM; nE++) { // Embedded downmix loop 

for (nCh=0; nCh<nDecCh [nE] ; nCh++) { //Supplemental Channel Loop 
nuMixMapMask [ns] [nE] [nCh] = ExtractBits (nNumMixOutCh [ns] ) ; 
nuNumMixCoef s [ns] [nE] [nCh] = 
CountBitsSet to 1 (nuMixMapMask [ns] [nE] [nCh] ) ; 

for <nC=0; nC<nuNumMixCoef s [ns] [nE] [nCh] ; nC+ + ) 

nuMixCoef f s [ns] [nE] [nCh] [nC] = ExtractBits ( 6 ) ; 
} // End supplemental channel loop 
} // End of Embedded downmix loop 
} // End configuration loop 
} // End if (bMixMetadataPresent ) 


nNumMixOutCh[ns] 
6 



Table 7-7: Audio Asset Descriptor Syntax: Decoder Navigation Data 



Audio asset Descriptor Syntax 



Size (Bits) 



nuCodingMode = ExtractBits (2 ) ; 
switch (nuCodingMode) { 
case : 

nuCoreExtensionMask = ExtractBits ( 12 ) ; 

If (nuCoreExtensionMask & DTS_EXSUBSTREAM_CORE) 

nuExSSCoreFsize = ExtractBits ( 14 ) +1 ; 

bExSSCoreSyncPresent = ExtractBits ( 1 ) ; 

if (bExSSCoreSyncPresent) 

nuExSSCoreSyncDistlnFrames = 1<< (ExtractBits (2 )) ; 

} 

If (nuCoreExtensionMask & DTS_EXSUBSTREAM_XBR) 

nuExSSXBRFsize = ExtractBits ( 14 ) +1 ; 
If (nuCoreExtensionMask & DTS_EXSUBSTREAM_XXCH) 

nuExSSXXCHFsize = ExtractBits ( 14 ) +1 ; 
If (nuCoreExtensionMask & DTS_EXSUBSTREAM_X96 ) 

nuExSSX96Fsize = ExtractBits ( 12 ) +1 ; 
If (nuCoreExtensionMask & DTS_EXSUBSTREAM_LBR) { 

nuExSSLBRFsize = ExtractBits ( 14 ) +1 ; 

bExSSLBRSyncPresent = ExtractBits ( 1 ) ; 

if (bExSSLBRSyncPresent) 

nuExSSLBRSyncDistlnFrames = 1<< (ExtractBits (2 )) ; 

} 

If (nuCoreExtensionMask & DTS_EXSUBSTREAM_XLL ) { 
nuExSSXLLFsize = ExtractBits (nuBits4ExSSFsize) +1 ; 
bExSSXLLSyncPresent = ExtractBits ( 1 ) ; 
if (bExSSXLLSyncPresent) { 

nuPeakBRCntrlBuf f SzkB = ExtractBits (4 ) <<4 ; 

nuBitsInitDecDly = ExtractBits ( 5 ) +1 ; 

nuInitLLDecDlyFrames=ExtractBits (nuBitsInitDecDly) ; 

nuExSSXLLSyncOf f set=ExtractBits (nuBits4ExSSFsize ) ; 



If (nuCoreExtensionMask & RESERVED_1) 

Ignore = ExtractBits ( 16 ) ; 
If (nuCoreExtensionMask & RESERVED_2) 

Ignore = ExtractBits ( 16 ) ; 
Break ; 
case 1 : 

nuExSSXLLFsize = ExtractBits (nuBits4ExSSFsize) +1 ; 
bExSSXLLSyncPresent = ExtractBits ( 1 ) ; 
If (bExSSXLLSyncPresent) { 

nuPeakBRCntrlBuf f SzkB = ExtractBits (4 ) <<4 ; 

NuBitsInitDecDly = ExtractBits ( 5 ) +1 ; 



12 

14 
1 

2 

14 
14 
12 

14 
1 



nuBits4ExSSFsize 
1 

4 
5 

nuBitsInitDecDly 
nuBits4ExSSFsize 



16 
16 



nuBits4ExSSFsize 
1 

4 
5 
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Audio asset Descriptor Syntax 


Size (Bits) 


nuInitLLDecDlyFrames = ExtractBits (nuBitsInitDecDly ) ; 
nuExSSXLLSyncOf f set=ExtractBits (nuBits4ExSSFsize ) ; 


nuBitsInitDecDly 
nuBits4ExSSFsize 


} 

break; 
case 2 : 

nuExSSLBRFsize = ExtractBits ( 14 ) +1 ; 
bExSSLBRSyncPresent = ExtractBits ( 1 ) ; 
if (bExSSLBRSyncPresent) 

nuExSSLBRSyncDistlnFrames = 1<< (ExtractBits (2 )) ; 
Break ; 
case 3 : 

nuExSSAuxFsize = ExtractBits ( 14 ) +1 ; 
nuAuxCodecID = ExtractBits ( 8 ) ; 
bExSSAuxSyncPresent = ExtractBits ( 1 ) ; 
if (bExSSAuxSyncPresent) 

nuExSSAuxSyncDistlnFrames = ExtractBits ( 3 ) +1 ; 
Break ; 
default : 
break; 


14 
1 

2 

14 
8 
1 

3 


} 

if ( ( (nuCodingMode==0) && (nuCoreExtensionMask & DTS_EXSUBSTREAM_XLL) ) | 
(nuCodingMode==l) ) { 

nuDTSHDStreamID = ExtractBits ( 3 ) ; 


3 


} 

if (bOne20neMapChannels2Speakers ==true && bMixMetadataEnbl ==true && 
bMixMetadataPresent==f alse) 

bOnetoOneMixingFlag = ExtractBits ( 1 ) ; 
if (bOnetoOneMixingFlag) { 

bEnblPerChMainAudioScale = ExtractBits ( 1 ) ; 
for (ns=0; ns<nuNumMixOutConf igs ; ns++) { 
if (bEnblPerChMainAudioScale) { 

for (nCh=0; nCh<nNumMixOutCh [ns] ; nCh++) 

nuMainAudioScaleCode [ns] [nCh] = ExtractBits ( 6 ) ; 

} 


1 
1 

6 


else 

nuMainAudioScaleCode [ns] [0] = ExtractBits ( 6 ) ; 




} 

} // End of bOnetoOneMixingFlag==true condition 
bDecodeAssetlnSecondaryDecoder = ExtractBits ( 1 ) ; 

bDRCMetadataRev2Present = (ExtractBits (1) == 1) ? TRUE : FALSE; 

{ 


1 
1 


DRCversion Rev2 = ExtractBits (4 ) ; 


4 


// one DRC value for each block of 256 samples 
nRev2 DRCs = nuExSSFrameDurationCode / 256; 
// assumes DRCversion_Rev2 == 1: 

for (subSubFrame=0 ; subSubFrame < nRev2 DRCs; subSubFrame++ ) 

{ 




DRCCoef f_Rev2 [subSubFrame] = dts_dynrng_to_db (ExtractBits ( 8 )) ; 

} 


8 * nRev2_DRCs 


} 

Reserved = ExtractBits (...) ; 
ZeroPadForFsize = ExtractBits ( ... 7) ; 


0...7 



7.4.2.1 Static Metadata 

nuAssetDescriptFsize (Size of Audio Asset Descriptor in Bytes) 

This field indicates the size of the audio asset descriptor in bytes, from nuAssetDescriptFsize to ZeroPadForFsize 
inclusive. If there are multiple audio assets, nuAssetDescriptFsize is used to navigate to the location of the next audio 
asset descriptor. 
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nuAssetlndex (Audio Asset Identifier) 

This parameter represents the unique audio asset index. Its range is from to nuNumAssets - 1. If the current asset is to 
be mixed into some lower index asset and the current extension substream index is zero (nExtSSIndex = 0), the current 
asset index shall be greater than zero (nuAssetlndex > 0), even if it is the first asset to be encoded in the extension 
substream 0. This condition guarantees correct decoder behaviour when the primary audio does not contain a DTS-HD 
extension substream. This can be either DTS core substream only or non-DTS primary audio. It is assumed that primary 
audio is carried as the audio asset in the extension substream 0. 

bAssetTypeDescrPresent (Asset Type Descriptor Presence) 

This field is present only if bStaticFieldsPresent is true. When set, it indicates that the audio asset type descriptor field 
follows. 

nuAssetTypeDescriptor (Asset Type Descriptor) 

This field is present only if bStaticFieldsPresent and bAssetTypeDescrPresent are true. This field represents the index 
into the audio asset type descriptor lookup table shown in Table 7-8. The value of nuAssetTypeDescriptor = 15 is 
reserved for future expanded audio asset types. Decoders that do not have an expanded asset type table definition shall 
treat this case as an "Unknown" audio asset type. 



Table 7-8: Audio Asset Type Descriptor Table 



nuAssetTypeDescriptor 


Comment 





Music 


1 


Effects 


2 


Dialog 


3 


Commentary 


4 


Visually impaired 


5 


Hearing impaired 


6 


Isolated music object (group of instruments/voices) 


7 


Music and Effects 


8 


Dialog and Commentary 


9 


Effects and Commentary 


10 


Isolated music object and Commentary 


11 


Isolated music object and Effects 


12 


Karaoke 


13 


Music, Effects and Dialog 


14 


Complete Audio Presentation 


15 


RESERVED 



bLanguageDescrPresent (Language Descriptor Presence) 

This field is present only if bStaticFieldsPresent is true. When set, it indicates that a language descriptor field follows. 
LanguageDescriptor (Language Descriptor) 

This field is present only if bStaticFieldsPresent and bLanguageDescrPresent are true. It denotes that a language code is 
associated with the audio asset. 

The code represents a 3-character language identifier according to the ISO 639-2/B [i.5]. Each character is coded using 
8 bits according to ISO/IEC 8859-1 [i.6] (ISO Latin- 1) and inserted in order into the 24-bit field 
(1 st character = 8 MSBs ... 3 rd character = 8 LSBs). 

blnfoTextPresent (Additional Textual Information Presence) 

This field is present only if bStaticFieldsPresent is true. When set, it indicates that additional textual information fields 
are present. 
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nuInfoTextByteSize (Byte Size of Additional Text Info) 

This field is present only if bStaticFieldsPresent and blnfoTextPresent are true. The value indicates the size in bytes of 
the text information field. 

InfoTextString (Additional Textual Information String) 

This field is present only if bStaticFieldsPresent and blnfoTextPresent are true. This character string may be used as a 
textual description of the audio asset. Each character in the string is encoded with 1-byte using the ISO Latin- 1 alphabet 
(ISO/IEC 8859-1 [i.6]). 

nuBitResolution (PCM Bit Resolution) 

This field is present only if bStaticFieldsPresent is true. It expresses the original resolution of the PCM audio source as 
output by an A/D converter. If the sample resolution differs between the channels, nuBitResolution indicates the 
maximum value among all channels in the audio asset. 

nuMaxSampleRate 

This field is present only if bStaticFieldsPresent is true. This field contains the index to the sample frequency, as 
defined in Table 7-9. It allows the system to quickly ascertain the maximum sample frequency among all channels 
present in the audio asset. 



Table 7-9: Source Sample Rate Table 



nuMaxSampleRate 


Sample Frequency (Hz) 





8 000 


1 


16 000 


2 


32 000 


3 


64 000 


4 


128 000 


5 


22 050 


6 


44 100 


7 


88 200 


8 


176 400 


9 


352 800 


10 


12 000 


11 


24 000 


12 


48 000 


13 


96 000 


14 


192 000 


15 


384 000 



nuTotalNumChs 

This field is present only if bStaticFieldsPresent is true. It represents the total number of channels that might be decoded 
individually. This field can range from to 255 and the total number of channels can range from 1 to 256. 

bOne20neMapChannels2Speakers 

This field is present only if bStaticFieldsPresent is true. If true, this flag indicates that each encoded channel within this 
audio asset represents a signal feed to a corresponding loudspeaker on the decode side. If this flag is false, it indicates 
that channels within this audio asset carry the signals that describe the sound field, but are not actual loudspeaker feeds. 
The actual loudspeaker feeds shall be derived on the decode side using the stream-embedded coefficients and possible 
user -provided adjustment factors. 

bEmbeddedStereoFlag 

This is present only when bStaticFieldsPresent and bOne20neMapChannels2Speakers are true and the total number of 
channels is greater than two (nuTotalNumChs > 2). When this flag is true, an embedded stereo downmix, generated 
during encoding, exists in the asset as a separate channel set in the front left and right channel pair (LR). When this is 
true, a decoder attempting to decode only two channels shall extract and decode the LR pair that already contains the 
stereo downmix and ignore all other channels. If the bEmbeddedStereoFlag = false, no embedded downmix is generated 
by the encoder. 
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bEmbeddedSixChFlag 

This is present only when bStaticFieldsPresent and bOne20neMapChannels2Speakers are true and if the total number 
of channels is greater than six (nuTotalNumChs > 6). When bEmbeddedSixChFlag = true, this indicates to the decoder 
that on the encode side, all relevant channels have been mixed: 

• into the standard 5.1 layout L, R, C, Ls, Rs, LFE; or 

• into the L, R, C, Lss, Rss, LFE layout when the 7.1 layout is in the L, R, C, Lss, Rss, LFE, Lsr and Rsr 
configuration. 

In this case, the decoder that attempts to decode only six channels may extract and decode the 5.1 channel downmix and 
ignore all other channels. If the bEmbeddedSixChFlag = false, the encoder did not perform the downmix to 
six channels' operation. 

The bEmbeddedSixChFlag field is part of the overview information and may be used by the system for quick access to 
the format description. All decoders shall extract and use this information during the extraction of the remaining 
extension substream header fields. After the extension substream header extraction, all decoders shall ignore this 
information, since more detailed information is available within the channel set sub-headers. 

bSpkrMaskEnabled 

This is present only when bStaticFieldsPresent and bOne20neMapChannels2Speakers are true. When 
bSpkrMaskEnabled is true, all loudspeaker locations are specified using predefined locations as listed in Table 7-10. 

nuNumBits4SAMask (Number of Bits for Speaker Activity Mask) 

This is present only if bStaticFieldsPresent, bOne20neMapChannels2Speakers and bSpkrMaskEnabled are true. The 
value indicates how many bits are used for packing the loudspeaker activity mask nuSpkrActivityMask and the standard 
speaker layout mask nuStndrSpkrLayoutMask. Valid values for nuNumBits4SAMask are 4, 8, 12 and 16 and are 
obtained from the extracted 2-bit code using the following mapping: 

nuNumBits4SAMask = (code + 1) « 2 

nuSpkrActivityMask (Loudspeaker Activity Mask) 

This is present only if bStaticFieldsPresent, bOne20neMapChannels2Speakers and bSpkrMaskEnabled are true. The 
nuSpkrActivityMask indicates which of the pre-defined loudspeaker positions apply to the audio channels encoded in 
DTS-HD stream. Each encoded channel or channel pair, depending on the corresponding speaker position(s), sets the 
appropriate bit in a loudspeaker activity mask. Predetermined loudspeaker positions are described in Table 7-10. For 
example, nuSpkrActivityMask = OxF indicates activity of C, L, R, L s , R s and LFEj loudspeakers. 

When the decoder supports up to a MaxSpeakers number of speakers and when that number is less than the number 
indicated by the total number of channels, i.e. MaxSpeakers < nuTotalNumChs, the decoder decodes channel sets, in 
increasing channel set index order, such that the total number of decoded channels is less or equal to MaxSpeakers 
channels. A channel set with index k, which brings the cumulative number of decoded channels, in all channel sets from 
to k, to be greater than MaxSpeakers is ignored and not decoded. All channel sets with index >k are also ignored and 
not decoded. In other words only channel sets with index from to k-1 are decoded. 
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Table 7-10: Loudspeaker Bit Masks for nuSpkr Activity Mask, 
nuStndrSpkrLayoutMask, nuMixOutChMask 



Nntatinn 




Rit Mack 

LJ 1 1 IVIClOlV 


liUIIIUCI Ul w 1 IC1I II ICIo 


L> 


CGntrG in front of listGnor 


uxuuin 


\ 


1 D 

Lri 


LGTl/nlyni In TrOnT 


UXUUIM 


o 

d. 


1 R 


Luii/riiyru buiiuunu on oiuc i r ■ icdi 


UXUUU^f 


O 

c. 


1 FF, 


LUW liyL]UUllL.y fcillfciOlb bUUWUUlcl 


uxuuuo 


H 
1 




oemre burrounci in rear 


UXUU I u 


I 


1 D 


Left/Right height in front 


AvflAOn 
UXUU<iU 


d. 


1 „.R 
i-srrtsr 


1 pft/Rinht qi irrm inrl in roar 

l_CI L/ niy 1 1 L oUM UUMU III Iddl 


UAUUtU 


2 


c h 


Centre Height in front 


0x0080 


1 


Oh 


Over the listener's head 


0x0100 


1 


L c Rc 


Between left/right and centre in front 


0x0200 


2 


L-wRw 


Left/Right on side in front 


0x0400 


2 


LssRss 


Left/Right surround on side 


0x0800 


2 


LFE 2 


Second low frequency effects subwoofer 


0x1000 


1 


LhsRhs 


Left/Right height on side 


0x2000 


2 


Chr 


Centre height in rear 


0x4000 


1 


LhrRhr 


Left/Right height in rear 


0x8000 


2 



nuNumSpkrRemapSets (Number of Speaker Remapping Sets) 

This is present only if bStaticFieldsPresent and bOne20neMapChannels2Speakers are true. This parameter indicates the 
number of standard loudspeaker layouts for which remapping coefficients are provided in the stream. 

nuStndrSpkrLayoutMask (Standard Loudspeaker Layout Mask) 

This is present only if bStaticFieldsPresent and bOne20neMapChannels2Speakers are true and if the number of speaker 
remapping sets is greater than zero, (nuNumSpkrRemapSets > 0). nuStndrSpkrLayoutMask indicates the standard 
speaker layout for which the remapping coefficients are provided. Coding of this field follows the same format as used 
for nuSpkrActivityMask , described in Table 7-10. 

nuNumDecCh4Remap (Number of Channels to be Decoded for Speaker Remapping) 

This is present only if bStaticFieldsPresent and bOne20neMapChannels2Speakers are true and if the number of speaker 
remapping sets is greater than zero (nuNumSpkrRemapSets > 0). For each loudspeaker remapping coefficient set that 
maps to a particular standard loudspeaker configuration, there are a specific number of encoded channels involved in 
the remapping. This is the number of channels that needs to be decoded in order to produce the speaker feeds for the 
standard loudspeaker configurations. 

nuRemapDecChMask (Decoded Channels to Output Speaker Mapping Mask) 

This is present only if bStaticFieldsPresent and bOne20neMapChannels2Speakers are true and if the number of speaker 
remapping sets is greater than zero (nuNumSpkrRemapSets > 0). Usually, a sub-set of all decoded channels is involved 
in remapping to a particular output loudspeaker and the bit-mask that defines this sub-set is transmitted in the parameter 
nuRemapDecChMask. Location of bits set to " 1 " indicates that the corresponding decoded channel is involved in 
mapping to the particular loudspeaker. The ordering of the channels is first according to the channel set index and next 
according to the channel mask nuSpkrActivityMask within each channel set. The location of a least significant bit 
corresponds to the first decoded channel. 

nuSpkrRemapCodes (Loudspeaker Remapping Codes) 

This is present only if bStaticFieldsPresent and bOne20neMapChannels2Speakers are true and if the number of speaker 
remapping sets is greater than zero (nuNumSpkrRemapSets > 0). It indicates the mapping from the encoded channels 
that are arranged in some non-standard loudspeaker configuration to the specified "standard" loudspeaker configuration. 
For each output speaker location, there is a sub-set of encoded channels that are contributing to the output speaker feed. 
This sub-set is described by nuRemapDecChMask and the number of coefficients per each output loudspeaker is equal 
to the number of bits that are set to " 1 " in each corresponding nuRemapDecChMask. The transmitted loudspeaker 
remapping coefficients are obtained from the extracted 5-bit codes according to the procedure that is described in clause 
C.3. The decoded channels that have their corresponding bits in nuRemapDecChMask set to "0" are not mapped to a 
specific speaker; in other words, their mapping coefficient is assumed to be -co on a log scale or on a linear scale. 
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nuRepresentationType (Representation Type) 

This is present only if bStaticFieldsPresent is true and bOne20neMapChannels2Speakers is false. It describes the type 
of representation according to Table 7-11. This information may be useful in some post-processing tasks. The decoder 
shall export this information to post-processing functions. 



Table 7-1 1 : Representation Type 



nuRepresentationType 


Description 


ObOOO 


Audio Asset for Mixing/Replacement 


0b001 


Not Applicable 


0b010 


Lt/Rt Encoded for matrix surround decoding implies nuTotalNumChs=2 


0b011 


Lh/Rh Audio processed for headphone playback implies nuTotalNumChs=2 


0b100 


Not Applicable 


0b101 - 0b1 1 1 


Reserved 



7.4.2.2 Dynamic Metadata 

bDRCCoefPresent (Dynamic Range Coefficient Presence Flag) 

When bDRCCoefPresent is true, the Dynamic Range Coefficient(s) (DRC) for a current audio asset is present in the 
stream. 

nuDRCCode (Code for Dynamic Range Coefficient) 

This field is present only if bDRCCoefPresent is true. Each 8-bit code is an unsigned integer and it indicates a 
logarithmic gain value in a range from -31,75 dB to 32 dB in steps of 0,25 dB. This format is identical to the DRC 
coefficients used for the DTS core. 

The calculation of the logarithmic gain value (DRC_dB) from the extracted code (DRC_Code) is: 

DRC_dB = -32 + (DRC_Code + 1) x 0,25 

To perform the dynamic range compression, the decoder multiplies the decoded audio samples by a linear coefficient 
obtained from the logarithmic gain value (DRC_dB). All channels of an audio asset are scaled by the same value. 

nuDRC2ChDmixCode (DRC for Stereo Downmix) 

This field is present only if bDRCCoefPresent and bEmbeddedStereoFlag are true. The calculation of the logarithmic 
gain value (DRC_dB) from the extracted code (nuDRC2ChDmixCode) is as follows: 

DRC_dB = -32 + (nuDRC2ChDmixCode + 1) x 0,25 

Dynamic range compression applied to the two-channel downmix may differ considerably from the dynamic range 
compression applied to multi-channel mixes. Consequently, when a two-channel downmix has been embedded on the 
encoder side, a separate dynamic range coefficient is transmitted for the two-channel downmix. The coding of this field 
is identical to the coding used for the nuDRCCode. When the decoder outputs a two-channel downmix, the decoder 
shall use the nuDRC2ChDmixCode for the dynamic range compression. 

bDialNormPresent (Dialog Normalization Presence Flag) 

When bDialNormPresent is true, the dialog normalization parameter for a current audio asset is present in the stream. 
nuDialNormCode (Dialog Normalization Code) 

This field is present only if the bDialNormPresent is true. nuDialNormCode indicates the dialog normalization gain and 
is transmitted as a 5-bit unsigned integer in the range from to 31. The dialog normalization gain (DNG), in dB, is 
specified by the encoder operator and is used to directly scale the decoder output samples in all channels of an audio 
asset. The dialog normalization gain (DNG) is obtained from the dialog normalization code (nuDialNormCode) simply 
by inverting the sign, i.e. 

DNG = -nuDialNormCode 
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The allowed range for DNG is from dB to -31 dB in steps of -1 dB. In case of lossless decoding, the value of 
DNG = dB shall enable the lossless reconstruction. When nuDialNormCode is updated, the transition from the 
previous value to the new value is described in clause C.6. 

bMixMetadataPresent (Mixing Metadata Presence Flag) 

This field is present only if bMixMetadataEnbl is true. When true, the mixing metadata for this audio asset is present in 
the stream. When bMixMetadataPresent is false, there is no mixing metadata after the bMixMetadataPresent field and 
the metadata parameters maintain their values from the previous frame. This field allows the rate of metadata 
transmission to be controlled. When bMixMetadataEnbl is false, the value for bMixMetadataPresent shall be false. 

bExternalMixFlag (External Mixing Flag) 

This is present only if bMixMetadataPresent is true. Secondary audio assets in some applications may be exported for 
mixing outside of the DTS-HD decoder. This field indicates whether the asset is exported (flag is true) or the asset is 
mixed within the DTS-HD decoder (flag is false). The decoder uses this flag to determine which assets shall be 
exported. 

nuPostMixGainAdjCode (Post Mixing/Replacement Gain Adjustment ) 

This field is present only if bMixMetadataPresent is true. When mixing multiple assets, the overall mixture is 
additionally scaled by the post-mix adjustment coefficient obtained from the nuPostMixGainAdjCode that is transmitted 
in the highest-indexed active asset of the desired audio presentation. All nuPostMixGainAdjCode values that are 
transmitted in lower-indexed active assets of the desired audio presentation are extracted and ignored. This scaling 
factor is applied to all speaker outputs after combining the last audio asset with the mixture of all lower-indexed active 
assets. The scale factor is in the range between -14,5 dB and +15 dB in steps of 0,5 dB. It is coded as a 6-bit unsigned 
index with a valid range from 1 to 60. The interpretation of the index and the calculation of the linear scale factor are 
described in clause C.4. When nuPostMixGainAdjCode is updated, the transition from the previous value to the new 
value is described in clause C.6. 

nuControlMixerDRC (Dynamic Range Compression Prior to Mixing) 

This is present only if bMixMetadataPresent is true. If bMixMetadataPresent=false the nuControlMixerDRC shall be set 
to its default value of 2. 

This is a 2-bit field that represents an index described in Table 7-12. 



Table 7-12: Dynamic Range Compression Prior to Mixing 



nuControlMixerDRC 


Description 





Prior to mixing, perform dynamic range compression on the mixture of all lower 
indexed active assets, using corresponding DRC code limited by the value 
indicated by nuLimit4EmbeddedDRC 


1 


Prior to mixing, perform dynamic range compression on the current audio asset 
only using associated DRC code limited by the value indicated by 
nuLimit4EmbeddedDRC 


2 


Prior to mixing, perform dynamic range compression on both the mixture of all 
lower indexed active assets and the current audio asset, each using its own DRC 
code limited by the value indicated by nuLimit4EmbeddedDRC 


3 


Prior to mixing, perform dynamic range compression on both the mixture of all 
lower indexed active assets and the current audio asset, each using the DRC 
code transmitted in the field nuCustomDRCCode 



nuLimit4EmbeddedDRC (Limit for Mixing Dynamic Range Compression) 

This field is present only if bMixMetadataPresent is true and nuControlMixerDRC < 3. It is coded as a 3-bit value 
representing an index in Table 7-13. If parameter nuLimit4EmbeddedDRC is not transmitted in a stream, its value shall 
be set to 7. 
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Table 7-13: Limit for Dynamic Range Compression Prior to Mixing 



nuLimit4EmbeddedDRC 


Limit for DRC code in % (ScaleDRC x 100) 





(disable prior to mixing DRC) 


1 


15 


2 


30 


3 


45 


4 


60 


5 


75 


6 


90 


7 


100 



By specifying a value for nuLimit4EmbeddedDRC a content provider may limit the dynamic range compression that is 
applied to audio assets prior to the mixing. In particular a listener may specify the desired amount (DesDRC) of 
dynamic range compression in the range of % to 100 %. The value (ScaleDRC x 100) coded in the parameter 
nuLimit4EmbeddedDRC represents the upper limit for the original listener requested amount of dynamic range 
compression (DesDRC). Therefore the decoder shall adjust its logarithmic gain value (DRC_dB corresponding to a 
DRCCode extracted from a stream) as follows: 

if <DesDRC>ScaleDRC*100) 

DRC_dB = DRC_dB* ScaleDRC 

else 

DRC_dB = DRC_dB* (DesDRC/100) . 

nuCustomDRCCode (Custom Code for Mixing Dynamic Range Coefficient) 

This field is present only if bDRCCoefPresent is true and nuControlMixerDRC = 3. When mixing audio assets, if 
custom dynamic range compression is required, an encoder operator may provide a custom DRC code that is 
transmitted in nuCustomDRCCode field. 

Each code is an 8-bit unsigned integer and indicates a logarithmic gain value in a range from -31,75 dB to 32 dB in 
steps of 0,25 dB. 

The calculation of the logarithmic gain value (DRC_dB) from the extracted code (nuCustomDRCCode) is as follows: 

DRC_dB = -32 + (nuCustomDRCCode + l)x0,25 

To perform dynamic range compression, the decoder multiplies the decoded audio samples by a linear coefficient 
obtained from the logarithmic gain value (DRC_dB). All channels of the mixture of all lower indexed active assets and 
the current audio asset are scaled by the same value. 

bEnblPerChMainAudioScale (Scaling Type for Channels of Main Audio) 

This field is present in one of the two cases, if: 

1) bMixMetadataPresent is true; or 

2) bOnetoOneMixingFlag is true. 

The above two cases are mutually exclusive since bMixMetadataPresent=true implies that bOnetoOneMixingFlag=false 
and also bOnetoOneMixingFlag=true implies that bMixMetadataPresent=false. 

If bEnblPerChMainAudioScale is false, a single scale factor is present for all main audio channels that are being mixed 
with the channels of this audio asset. If bEnblPerChMainAudioScale is true, each main audio channel that is being 
mixed with the channels of the current audio asset has its own scale factor. 

nuMainAudioScaleCode (Scaling Parameters of Main Audio) 

This is present only if bMixMetadataPresent is true or if bOnetoOneMixingFlag is true for each mixing configuration 
independently. 
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It modifies the gain of the audio program that is being mixed with the current audio program. The number of main 
audio scale factors is equal to the number of mixer output channels that are defined for a particular mixing 
configuration. This number is derived by looking at Table 7-10, cumulatively adding the entries in column denoted by 
"Number of Channels" for each set ("1") bit of nuMixOutChMask. The ordering of the coefficients is derived by 
checking the bits that are set in the nuMixOutChMask, starting from the LSB and by looking into Table 7-10 for the 
corresponding channel labels. In case of the channel label that indicates a channel pair, the coefficient for the left 
channel of the pair comes first. For example the nuSpkrActivityMask = 0x2F indicates a 7. 1 

(C+LR+L s R s +LFEl+L h R h ) mixer configuration and consequently the main audio scale factors for this configuration 
will be ordered as C, L, R, L s , R s , LFEj, L h and R h . The scale factors are obtained from the extracted 6-bit codes 
(nuMainAudioScaleCode) according to the procedure described in clause C.5. When nuMainAudioScaleCode is 
updated, the transition from the previous value to the new value is described in clause C.6. 

nuMixMapMask (Mix Output Mask) 

This is present only if bMixMetadataPresent is true, for each of the output mixing configurations, for each embedded 
downmix configuration present in the current audio asset and for each channel of a specific embedded downmix 
configuration. This mask defines the mixing map from each channel of current audio asset to output mixer. Note that, in 
this context, the full mix is categorized as one of the embedded downmix configurations with the number of channels 
equal to the nuTotalNumChs. 

The number of mixer output channels that are defined for a particular mixing configuration is derived from the 
nuMixOutChMask. Each channel of the current audio asset may be mapped to any of the mixer output channels using 
the mixing coefficients described in nuMixCoeffs . 

The nuMixMapMask has a dedicated bit (channel flag) for each nNumMixOutCh mixer output channel, keeping them 
in the same order as defined by the nuMixOutChMask. In particular the order of channel flags in the nuMixMapMask, 
when starting from the LSB, is derived by checking the bits that are set in the nuMixOutChMask starting from the LSB 
and by looking into Table 7-10 for the corresponding channel labels. In case of the channel label that indicates a 
channel pair, the channel flag for the left channel of the pair comes first. 

A channel of the current audio asset is mapped exclusively to those mixer output channels that have their bits in 
nuMixMapMask set to "1". Mapping to all other output channels assumes mixing coefficient equal to -co dB. 

nuMixCoeffs (Mixing Coefficients) 

This is present only if bMixMetadataPresent is true, for each of the output mixing configurations, each embedded 
downmix configuration present in the current audio asset and each channel of a specific embedded downmix 
configuration. Note that, in this context, the full mix is categorized as one of the embedded downmix configurations, 
with the number of channels equal to the nuTotalNumChs. 

The scale factors are obtained from the extracted 6-bit codes according to the procedure that is described in clause C.5. 
The transition from the previous value to the new value when nuMixCoeffs is updated is described in clause C.6. 

7.4.2.3 Decoder Navigation Data 

nuCodingMode (Coding Mode for the Asset) 

This is a 2-bit field that represents an index into a look-up table. The look-up table, Table 7-14, describes the audio 
coding modes that may be used for compression of audio in the asset. 



Table 7-14: Coding Mode 



nuCodingMode 


Description 





DTS-HD Coding Mode that may contain multiple coding components 


1 


DTS-HD Loss-less coding mode without CBR component 


2 


DTS-HD Low bit-rate mode 


3 


Auxiliary coding mode 



The auxiliary coding mode is reserved for future applications. 
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nuCoreExtensionMask (Coding Components Used in Asset) 

This field is present only when nuCodingMode = 0. This is an array of 12 bits indicating the core and the extensions 
that are used for coding the current audio asset. A '1' in the mask at the bit location reserved for a specific coding 
component indicates that the particular coding component is used in this audio asset. A '0' in the mask at the bit location 
reserved for a specific coding component indicates that the particular coding component is NOT used in this audio 
asset. The association of bit locations within nuCoreExtensionMask to the specific coding components is described in, 
Table 7-15, shown below. Data for each substream type within an audio asset will appear in the same order (LSB first) 
as the bit mask position. 



Table 7-15: Core/Extension Mask 



Notation 


Core/Extension Type Description 


CBR or 
VBR 


nuCoreExtensionMask 


DTS CORESUB STREAM COR 
E 


Core component within the core 

qi ihQtrpa m 

oUUoLI Cdl 1 1 


CBR 


0x001 


DTS_BCCORE_XXCH 


XXCH extension, when combined 
with P99nriatpH rnrp mav hp cipnt 

Willi aoouoiQLCU ^ui c, i i lay u c; oci u 

via SPDIF for backward 
oomnatibilitv 


CBR 


0x002 


DTS_ BCCORE _X96 


X96 extension, when combined with 
associated core, may be sent via 
SPDIF for backward compatibility 


CBR 


0x004 


DTS_ BCCORE_XCH 


XCH extension, when combined 
with associated core, may be sent 
via SPDIF for backward 
compatibility 


CBR 


0x008 


DTS_EXSUB_STREAM_CORE 


Core component within the current 
extension substream 


CBR 


0x010 


DTS_EXSUB_STREAM_XBR 


XBR extension within the current 
extension substream 


CBR 


0x020 


DTS_EXSUB_STREAM_XXCH 


XXCH extension within the current 
extension substream 


CBR 


0x040 


DTS_EXSUB_STREAM_X96 


X96 extension within the current 
extension substream 


CBR 


0x080 


DTS_EXSUB_STREAM_LBR 


Low bit rate component within the 
current extension substream 


CBR 


0x100 


DTS_EXSUB_STREAM_XLL 


Lossless extension within the 
current extension substream 


VBR 


0x200 


RESERVED 1 


0x400 


RESERVED2 


0x800 



The bit locations 0x400 and 0x800 within the nuCoreExtensionMask are reserved for future use and all the decoders 
that comply with this version of the specification shall ignore all coding components that have any of these 2 bits set to 

tt ^ tt 

nuExSSCoreFsize (Size of Core Component in Extension Substream) 

This field is present only when nuCodingMode = and the DTS_EXSUB_STREAM_CORE bit of 
nuCoreExtensionMask is set to "1". 

This field indicates per frame payload size, in bytes, of a core component of the audio asset present in the extension 
substream. When the asset consists of core and extension(s) that are carried in the extension substream, the 
nuExSSCoreFsize is used to navigate to the location of a first extension as indicated by the nuCoreExtensionMask. 

bExSSCoreSyncPresent (Core Sync Word Present Flag) 

This field is present only when nuCodingMode = and the DTS_EXSUB_STREAM_CORE bit of 
nuCoreExtensionMask is set to "1". 

If bExSSCoreSyncPresent is true, the sync word for the associated core component is present. This indicates to the asset 
decoder that it should attempt to establish/verify synchronization in the current frame. 
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If bExSSCoreSyncPresent is false, no core sync word is present in the current frame of extension substream. If the asset 
decoder is not synchronized, it is not able to establish synchronization. If the asset decoder is synchronized, it consumes 
the nuExSSCoreFsize bytes of the core data. 

nuExSSCoreSyncDistlnFrames (Core Sync Distance) 

This 2-bit field is present only if bExSSCoreSyncPresent is true. The nuExSSCoreSyncDistlnFrames represents the 
distance between the two sync words in the core stream of the current asset measured in number of extension substream 
frames. The nuExSSCoreSyncDistlnFrames takes values 1, 2, 4 and 8, which are obtained from the transmitted 2-bit 
codes 0, 1, 2 and 3 respectively. 

nuExSSXBRFsize (Size of XBR Extension in Extension Substream) 

This field is present only when nuCodingMode = and the DTS_EXSUB_STREAM_XBR bit of 
nuCoreExtensionMask is set to "1". 

This field indicates per frame payload size, in bytes, of XBR extension of the audio asset present in the extension 
substream. When the asset contains additional extensions that follow the XBR extension in the extension substream, the 
nuExSSXBRFsize is used to navigate to the location of the next extension as indicated by the nuCoreExtensionMask. 

nuExSSXXChFsize (Size of XXCH Extension in Extension Substream) 

This field is present only when nuCodingMode = and DTS_EXSUB_STREAM_XXCH in nuCoreExtensionMask is 
set to "1". 

This field indicates per frame payload size, in bytes, of XXCH extension of the audio asset present in the extension 
substream. When the asset contains additional extensions that follow the XXCH extension in the extension substream, 
the nuExSSXXCHFsize is used to navigate to the location of the next extension as indicated by the 
nuCoreExtensionMask. 

nuExSSX96Fsize (Size of X96 Extension in Extension Substream) 

This field is present only when nuCodingMode = and the DTS_EXSUB_STREAM_X96 bit of nuCoreExtensionMask 
is set to "1". 

This field indicates per frame payload size, in bytes, of X96 extension of the audio asset present in the extension 
substream. When the asset contains additional extensions that follow the X96 extension in the extension substream, the 
nuExSSX96Fsize is used to navigate to the location of the next extension as indicated by the nuCoreExtensionMask. 

nuExSSLBRFsize (Size of LBR Component in Extension Substream) 

This field is present when: 

• nuCodingMode = and DTS_EXSUB_STREAM_LBR in nuCoreExtensionMask is set to " 1" ; or 

• nuCodingMode = 2, which indicates that the asset is coded using only a low bit rate component. 

This field indicates per frame payload size, in bytes, of LBR component of the audio asset present in the extension 
substream. When the asset contains additional extensions that follow the LBR component in the extension substream, 
the nuExSSLBRFsize is used to navigate to the location of the next extension as indicated by the 
nuCoreExtensionMask. 

bExSSLBRSyncPresent (LBR Sync Word Present Flag) 

This field is present when: 

• nuCodingMode = and DTS_EXSUB_STREAM_LBR in nuCoreExtensionMask is set to " 1 " ; or 

• nuCodingMode = 2, indicating the asset is coded using only a low bit rate component. 

If bExSSLBRSyncPresent is true for an asset in the current frame of the extension substream, the sync word for the 
associated LBR component is present. This indicates to the asset decoder that it should attempt to establish/verify 
synchronization in the current frame. 
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If bExSSLBRSyncPresent is false for an asset, no LBR sync word is present in the current frame of extension substream 
and the asset decoder is not capable, when starting from unsynchronized state, to establish the synchronization with the 
LBR data present in this frame. If the asset decoder is in a synchronized state, it consumes the nuExSSLBRFsize bytes 
of the LBR data. 

nuExSSLBRSyncDistlnFrames (LBR Sync Distance) 

This 2-bit field is present only if bExSSLBRSyncPresent is true. The nuExSSLBRSyncDistlnFrames represents the 
distance between the two sync words in the LBR stream of the current asset measured in number of extension substream 
frames. The nuExSSLBRSyncDistlnFrames takes values 1, 2, 4 and 8, which are obtained from the transmitted 
2-bit codes 0, 1, 2 and 3 respectively. 

nuExSSXLLFsize (Size of XLL Data in Extension Substream) 

This field is present when either: 

• nuCodingMode = and DTS_EXSUB_STREAM_XLL in nuCoreExtensionMask is set to " 1" ; or 

• nuCodingMode = 1, which indicates that asset is coded using only a lossless component. 

This field indicates the size, in bytes, of XLL extension of the audio asset present in the extension substream. 
bExSSXLLSyncPresent (XLL Sync Word Present Flag) 
This field is present when either: 

• nuCodingMode = and DTS_EXSUB_STREAM_XLL in nuCoreExtensionMask is set to " 1 " ; or 

• nuCodingMode = 1, which indicates that asset is coded using only a lossless component. 

If the bExSSXLLSyncPresent is true for an asset in current frame of the extension substream, the sync word for the 
associated XLL component is present. This indicates to the asset decoder that it should attempt to establish/verify 
synchronization in the current frame. 

If bExSSXLLSyncPresent is false for an asset, no XLL sync word is present in the current frame of extension substream 
and the asset decoder is not capable, when starting from unsynchronized state, to establish the synchronization with the 
XLL data present in this frame. If the asset decoder is in a synchronized state it consumes the nuExSSXLLFsize bytes 
of the XLL data. 

nuPeakBRCntrlBuffSzkB (Peak bit rate smoothing buffer size) 

This field is present when bExSSXLLSyncPresent is true and when either: 

• nuCodingMode = and DTS_EXSUB_STREAM_XLL in nuCoreExtensionMask is set to " 1" ; or 

• nuCodingMode = 1 . 

This field represents the size, in kBytes, of the peak bit rate smoothing buffer, which has been assumed to exist on the 
decode side during a lossless encoding/authoring of the current asset. 

The available sizes are from Kbyte to 240 kBytes in steps of 16 kBytes. 

Each lossless encoded asset has its corresponding smoothing buffer and nominally the sum of buffer sizes for all active 
assets shall not exceed the specified decoder's buffer size. However, next-generation streams may be created such that 
for a certain subset of all active assets, all decoders meet the buffer size requirements, but for decoding of all active 
assets, the new generation decoders are required. Decoders capable of decoding only a subset of all active assets 
(because of limitation in available buffer size) shall ignore all other assets that do not belong to the specified subset. 
This subset consists of the lowest index active assets that jointly do not require more than available decoder buffer size. 

nuBitsInitDecDly (Size of field nuInitLLDecDlyFrames) 

This field is present when bExSSXLLSyncPresent is true and when either: 

• nuCodingMode = and DTS_EXSUB_STREAM_XLL in nuCoreExtensionMask is set to " 1" ; or 

• nuCodingMode = 1 . 
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The nuBitsInitDecDly is the number of bits used to extract the parameter nuInitLLDecDlyFrames (Initial XLL 
Decoding Delay in Frames). 

nuInitLLDecDlyFrames (Initial XLL Decoding Delay in Frames) 

This field is present when bExSSXLLSyncPresent is true and when either: 

• nuCodingMode = and DTS_EXSUB_STREAM_XLL in nuCoreExtensionMask is set to " 1 " ; or 

• nuCodingMode = 1 . 

The nuInitLLDecDlyFrames is the number of frames to delay lossless decoding of the current asset after the initial 
synchronization is established. It instructs the decoder to wait 'nuInitLLDecDlyFrames' frames before decoding the first 
frame of the current asset, after establishing/re-establishing the synchronization. For all consecutive frames, as long as 
the decoder is synchronized, the decoder ignores the nuInitLLDecDlyFrames. 

This value tells the decoder how many frames the lossless frame in question needs to be delayed until it can be decoded. 
The frame is placed into the decoder's buffer and interprets the offset as a time stamp of the delay. The decoder receives 
the data, detects the offset number and does not decode the frame until the specified offset number of frames intervals 
has elapsed. 

The frame offset specifies a schedule of when the data is in the buffer and when it is to be decoded. If the stream 
consists of both lossy and lossless substreams the decoder shall decode lossy data and output the lossy decoded audio 
while waiting for 'nuInitLLDecDlyFrames' delay to expire. If the stream consists of lossless data only, decoder outputs 
shall be muted until the 'nuInitLLDecDlyFrames' delay expires. 

nuExSSXLLSyncOffset (Number of Bytes Offset to XLL Sync) 

This field is present when bExSSXLLSyncPresent is true and when either: 

• nuCodingMode = and DTS_EXSUB_STREAM_XLL in nuCoreExtensionMask is set to " 1 " ; or 

• nuCodingMode = 1 . 

This specifies the number of bytes offset (from start of XLL data in current asset) to locate the first XLL sync word in 
the current asset. 

nuExSSAuxFsize (Size of Auxiliary Coded Data) 

This field is present only when nuCodingMode = 3. 

This field indicates the size, in bytes, of auxiliary coded data present in the extension substream for the current audio 
asset. 

nuAuxCodecID (Auxiliary Codec Identification) 

This 8-bit field is present only if nuCodingMode = 3. Its value represents an index into the auxiliary codec lookup table. 
This feature is not supported in the current version of DTS-HD. 

bExSSAuxSyncPresent (Aux Sync Word Present Flag) 

This field is present only when nuCodingMode = 3. 

If the bExSSAuxSyncPresent is true for an asset in the current frame of the extension substream, the sync word for the 
associated auxiliary component is present. This indicates to the asset decoder that it should attempt to establish/verify 
synchronization in the current frame. 

If bExSSAuxSyncPresent is false for an asset, no auxiliary sync word is present in the current frame of extension 
substream and the asset decoder is not capable, when starting from unsynchronized state, to establish the 
synchronization with the auxiliary coded data present in this frame. If the asset decoder is in a synchronized state, it 
consumes the nuExSSAuxFsize bytes of the auxiliary coded data. 
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nuExSSAuxSyncDistlnFrames (Aux Sync Distance) 

This 3-bit field is present only if bExSSAuxSyncPresent is true. The nuExSSAuxSyncDistlnFrames represents the 
distance between the two sync words in the auxiliary stream of the current asset measured in number of extension 
substream frames. The nuExSSAuxSyncDistlnFrames takes values in a range from 1 to 8. 

nuDTSHDStreamID (DTS-HD Stream ID) 

This 3-bit field is present only if the XLL coding component is present in the asset. It represents the unique ID number 
(ranging from to 7) of a DTS-HD stream that carries the asset data. All assets within the same DTS-HD stream would 
have the same value for the nuDTSHDStreamID. This parameter is used to indicate to the decoder that seamless 
switching between the two DTS-HD streams, both carrying an asset with the XLL coding components, has occurred in 
the player. 

If the value of nuDTSHDStreamID changes from the value present in the previous frame, the XLL stream decoding is 
disabled in the current (transition) frame and the XLL receive and output buffers are zeroed out. 

When the lossy component is used together with the XLL extension, the decoding of the lossy stream is uninterrupted. 
The audio decoded from the lossy component of the new stream will be played out during the transition frame. The 
XLL decoding will continue in the following frame according to the buffering delay obtained from the new stream. 
After the expiration of imposed delay on the XLL decoding, both the lossy and the XLL components are decoded and 
combined together to generate lossless decoded audio. 

When the XLL is used in standalone mode (without lossy component), the audio will be muted in the transition frame 
and the audio in the frame after the transition frame will be faded in. 

bOnetoOneMixingFlag 

This flag is present only when bOne20neMapChannels2Speakers and bMixMetadataEnbl are true and 
bMixMetadataPresent is false. 

This flag when true indicates a simplistic asset mixing scenario in which the current audio asset channels are directly 
added to the corresponding audio asset, with appropriate scaling of the primary audio channels or appropriate scaling of 
the current audio asset channels depending upon the bDecodeAssetlnSecondaryDecoder flag. 

When bOnetoOneMixingFlag is true and bDecodeAssetlnSecondaryDecoder is true, the simplistic asset mixing 
scenario has the current audio asset channels directly added to the corresponding primary audio channels with 
appropriate scaling of the primary audio channels. In the case of a 5.1 primary audio asset and a 2.0 secondary audio 
asset, the left secondary audio channel will only be added to the left primary audio channel. Similarly, the right 
secondary audio channel will only be added to the right primary audio channel. Primary audio channels may be 
individually scaled prior to the addition of secondary audio channels using the coefficients that are obtained from the 
codes (nuMainAudioScaleCode) transmitted immediately after the bOnetoOneMixingFlag. 

When bOnetoOneMixingFlag is true and bDecodeAssetlnSecondaryDecoder is false, the simplistic asset mixing 
scenario has the current audio asset channels directly added to corresponding audio channels of all active assets with 
appropriate scaling of the current audio channels. In the case of active audio assets with 5.1 channels, the left audio 
channel of the current asset will only be added to the left audio channel of other active assets. Similarly, the right audio 
channel of the current asset will only be added to the right audio channel of other active assets. The current audio 
channels may be individually scaled prior to the addition with other active asset audio channels using the coefficients 
that are obtained from the codes (nuMainAudioScaleCode) transmitted immediately after the bOnetoOneMixingFlag. 

The mixing scenario described above minimizes the amount of mixing metadata that needs to be transmitted. 

bDecodeAssetlnSecondaryDecoder 

This flag when true indicates that the current asset is to be decoded by the secondary decoder. In particular the primary 
decoder shall be able to decode at least one active audio asset (with bDecodeAssetlnSecondaryDecoder =false) and the 
secondary decoder decodes up to one active audio asset. This is a guarantee that the solutions implementing both 
primary and secondary decoders will at least be able to decode two audio assets. It is the responsibility of the external 
post-process to combine the audio obtained from the primary and secondary decoder. This external process is either 
controlled by the external metadata or by the metadata present in the audio asset with 

bDecodeAssetlnSecondaryDecoder = true and bExternalMixFlag = true. In later case the external process will request 
the metadata from the secondary decoder. 
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bDRCMetadataRev2Present 

When the bDRCMetaDataRev2Present flag is TRUE, the Rev2 DRC metadata that enables dynamic range control with 
a time resolution of one value for every period of 256*RefClockPeriod seconds, will be present. This Rev2 DRC 
metadata shall be used for controlling the dynamic range of all decoded channels in the current asset instead of: 

• the nuDRCCode and the nuDRC2ChDmixCode metadata that may be present in the current asset descriptor; or 

• the subsubFrameDRC_Rev2AUX that may be present in a DTS core associated with the current asset. 

If the bDRCMetaDataRev2Present flag is FALSE, no Rev2 DRC metadata is present in the current asset descriptor. 
DRCversion_Rev2 

This field will be present only if bDRCMetaDataRev2Present flag is TRUE. The DRCversion_Rev2 is a four bit field 
which is used to determine the version of DRC algorithm which the encoder used. The first version starts at 0x1. 
Decoders will support DRC version 0x1 up to the latest version which they support. Currently only DRCversion_Rev2 
= 1 is supported. If the encoder is supplying DRC information with a version number higher than that which is 
supported by the decoder, the supplied Rev2 DRC values should be ignored and no DRC should be applied. 

DRCCoeff_Rev2 

This field will be present only if bDRCMetaDataRev2Present flag is TRUE. Currently only DRC version 1 is 
supported, which is single band mode. In single band mode, one 8 bit value is transmitted every 256*RefClockPeriod 
seconds. Consequently in a single band mode in each frame there will be nuExSSFrameDurationCode/256 values for 
DRCCoeff_Rev2. 

Each 8-bit value for DRCCoeff_Rev2 is extracted from the bitstream and converted into a dB gain by function 
dts_dynrng_to_db(). 

Reserved (Reserved) 

This field is reserved for extending the information present in the audio asset descriptor. The decoder shall assume that 
this field is present and of unspecified duration. Therefore, in order to continue unpacking the stream, the decoder shall 
skip over this field using the extension substream header start pointer and the audio asset descriptor size 
nuAssetDescriptFsize. 

ZeroPadForFsize (Make nuAssetDescriptFsize a multiple of 8 bits) 

This field ensures that the size of an audio asset descriptor is an integer number of bytes. Encoder appends '0's until the 
current bit position relative to the first packed bit of nuAssetDescriptFsize is a multiple of 8 bits. 



The DTS lossless coding extension (XLL) results in a bit for bit accurate reproduction of the input signal. The lossless 
encoding system has provisions to support a maximum audio sample rate of 384 kHz and the capability to support high 
channel counts when used in the DTS-HD framework, which also permits nearly arbitrary speaker mapping. 

Since this is a bit accurate system, XLL may be used alone, or it may be used as a residual coder in conjunction with a 
lossy coding system, as in the current commercial application of DTS-HD Master Audio ™. This clause describes the 
XLL extension. 



As depicted in Figure 8-1, an XLL lossless frame consists of a header and one or more frequency bands. Furthermore, 
each frequency band consists of one or more data segments. The number of data segments in each frequency band is the 
same. Time duration of each segment is the same for all segments in the frame. 



8 



DTS Lossless Extension 




8.1 



Lossless Frame Structure 
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Segment 



Segment q 



Frequency 
Band p 



Figure 8-1 : XLL Lossless Frame 

Both the header and the segments are each further sub-divided by the channel set information. The channel set 
information is a representation of the source material in either downmixed or unmixed form. Similar to the segments, 
each expresses the same duration in seconds, but not necessarily the same number of samples, because each channel set 
can represent different sample rates. 

By way of example, in a given segment the first set (ChSET ) could represent 5.1 downmixed primary channels with 
the second set (ChSETj) representing two extra ES channels for reversing the downmix to define a 7.1 discrete 
environment. 



8.1.1 Header Structure 

As shown in Figure 8-2, the lossless frame header consists of a sync word, common setup data (such as the number of 
frequency bands, the number of segments in the frame, the number of samples in each segment), information specific to 
each of the channel sets (such as number of channels, channel layout and sampling frequency) and the navigation 
indices. For example, a frame containing two channel sets consisting of a primary 5.1 set and an ES channel set would 
be specified by data consisting of the common header field plus two specific channel set sub-headers and a navigation 
index (NA VI) table. 



S 
Y 
N 
C 



Common 



SUB- 
HEADER 
ChSET 



SUB- 
HEADER 

ChSET( M ) 



HEADER 



NAVI 



Figure 8-2: XLL Header Structure 



8.1.1.1 Common Header 

The common header area indicates the header size, which helps locate the first channel set sub-header and the checksum 
field. It also contains flags indicating if a core is present, if the core should be interpolated, the total size of the entire 
frame and a frame offset designating the number of frames by which to delay the decoding in order to facilitate buffer 
management at the decoder. The common header also includes the number of segments per frame and the number of 
samples in each segment of channel set 1 . (All other channel sets scale the number of samples in a segment using the 
ratio of their sampling frequency and the sampling frequency of the first channel set). 

A checksum is included at the end of the common header as a verification measure and to help detect erroneous sync 
word caused by alias occurrences of the sync word. Since aliases of the sync word are a concern, the decoder shall 
always calculate the header checksum fields to verify the validity of the detected sync word. To this end, when a valid 
checksum pattern is discovered, decoding can begin immediately. See Annex B for a complete discussion on cycle 
redundancy checking (CRC). 
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8.1 .2 Channel Set Sub-Header 

Specific header data for each channel set is stored within the channel set sub-header. The channel set sub-header field 
includes its size, the number of channels in the set, the sampling rate and specific data, such as the replacement flag and 
downmixing coefficients. 

The first channel set sub-header is read by indexing its start position, which is reached by computing the offset frame 
start position plus the size of the common header. From this position the sub-header data for the first channel set is 
unpacked. The next channel set sub-header is located by extracting the mChSetHeaderFSize of the current channel 
set and advancing by mChSetHeaderFSize bytes from the start of the current channel set. 

8.1 .3 Navigation Index 

The NAVI table consists of the sizes of individual frequency bands. It is described in more detail in clause 8.3.1. 

8.1 .4 Frequency Band Structure 

The frequency band is considered to be a base band and it is always present in the stream. Up to three extended 
frequency bands may also be present in the stream, supporting the sampling frequency of up to four times the sampling 
frequency of the base band. For example, all channel sets that have the sampling frequency less than or equal to 96 kHz 
are coded entirely within the frequency band 0. The channel sets that have a sampling frequency of 192 kHz are coded 
using both the frequency band and the frequency band 1 . Finally, the channel sets that have a sampling frequency of 
384 kHz are coded using all frequency bands: 0, 1, 2 and 3. 

The number of extended frequency bands is part of the header information. After adding the number of extended bands, 
the total number of frequency bands can be 1,2, or 4. On the encode side, the number of frequency bands is determined 
simply by the underlying maximum sampling frequency among all of the channel sets. In particular, for sampling 
frequency F s the number of encoded frequency bands is determined as follows: 

• Number of frequency bands is 1 for F s < Base_F s 

• Number of frequency bands is 2 for Base_F s < F s < 2 x Base_F s 

• Number of frequency bands is 4 for 2 x Base_Fs < Fs < 4 x Base_Fs 
where Base_F s denotes the base sampling frequency i.e. 64 kHz, 88.2 kHz, or 96 kHz. 

A decoder that supports only sampling frequencies up to the Base_F s shall decode only the frequency band and skip 
over the remaining bands. Similarly, a decoder that supports only sampling frequencies up to the 2xBase_F s shall 
decode only the frequency bands and 1 and skip over the remaining bands. 

8.1 .5 Segments and Channel Sets 

The segments and channel sets further subdivide the frequency band data. A segment within a frequency band contains 
the encoded frequency band samples, over a specific period of time (segment duration), for all channel sets present in 
the frequency band. 

As shown in Figure 8-3, the encoded and packed channel set data are interleaved into a segment. There are no headers 
associated with these two levels of abstraction. A segment represents the same time duration for all channels sets when 
unpacked. 





ChSETo 




ChSET, 




ChSETo) 






SEGMENT 













Figure 8-3: Segment with Encoded and Packed Channel Sets 
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8.2 Lossless Stream Syntax 
8.2.1 Common Header 

Table 8-1 describes the composition of the common header. 



Table 8-1 : Common Header 



Syntsx 


oize (diisj 


SYNCXLL = ExtractBits (32) ; 


32 


nVersion = ExtractBits (4) + 1; 


4 


nHeaderSize = ExtractBits ( 8 ) + 1 ; 


8 


nBits4FrameFsize = ExtractBits ( 5 ) + 1 


5 


nLLFrameSize = ExtractBits (nBits4FrameFsize) + 1 ; 


nBits4FrameFsize 


nNumChSetsInFrame = ExtractBits (4) + 1 ; 


4 


tmp = ExtractBits (4) ; 


4 


nSegment s InFrame = 1 << tmp; 




tmp = ExtractBits (4) ; 


4 


nSmplInSeg = 1 << tmp; 




nBits4SSize = ExtractBits ( 5 ) + 1; 


5 


nBandDataCRCEn = ExtractBits ( 2 ) ; 


2 


bScalableLSBs = ExtractBits (1) ; 


1 


nBits4ChMask = ExtractBits ( 5 ) + 1; 


5 


if (bScalableLSBs) 




nuFixedLSBWidth = ExtractBits (4) ; 


4 


Reserved = ExtractBits (...) ; 




ByteAlign = ExtractBits ( ... 7) 


0...7 


nCRC16Header = ExtractBits ( 16 ) ; 


16 



SYNCXLL (XLL extension sync word) 

The lossless DWORD aligned synchronization word has value 0x41A29547. During sync detection the nCRC16Header 
checksum is used to further verify that the detected sync pattern is not a random alias. The DWORD alignment makes it 
necessary to append a total of from 1 to 3 extra zero bytes after the last band data of the previous frame. 

nVersion (Version number) 

This is the lossless stream syntax version identification. If the version indicated here is greater than the version of the 
decoder, the decoder should not attempt to process the stream further. 

nHeaderSize (Lossless frame header length) 

This is the size of the common header in bytes from the SYNCXLL to nCRC16Header inclusive. This value determines 
the location of the first channel set header. This marker also designates the end of the field nCRC16Header and allows 
quick location of the checksum at byte position nHeaderSize-2. 

nBits4FrameFsize (Size of field nBytesFrameLL) 

This is the number of bits less one used to store the lossless frame size parameter nLLFrameSize. 
nLLFrameSize (Number of bytes in a lossless frame) 

This is the total number of bytes in a lossless frame. This value is used to traverse to the end of the frame. 
nNumChSetsInFrame (Number of Channel Sets per Frame) 

This is the number of channel sets. It is used to loop through the channel set sub-headers, the channel sets in each 
segment and the NAVI index. 

nSegmentsInFrame (Number of Segments per Frame) 

This is the number of segments in the current frame. Representing a binary exponent, the actual segment count is 1 left 
shifted by the extracted value. 
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nSmplInSeg (Samples in a segment per one frequency band for the first channel set) 

This is the number of samples in a segment per one frequency band in the first channel set. Representing a binary 
exponent, the actual number of samples in a segment is 1 left shifted by the extracted value. All subsequent channel sets 
determine their nSmplInSeg by first determining the sampling frequency scaling factor against the first channel set. For 
example, assume: 

• the first channel set with the sampling rate Fsl and the number of frequency bands equal to 
m_nNumFreqBandsl (see page 124 for the definition of m_nNumFreqBands); 

• the second channel set with the sampling rate Fs2 and the number of frequency bands equal to 
m_nNumFreqBands2; 

• then the second channel set will have the number of segment samples per frequency band equal to: 

„ „ Fs2xm nNumFreqBandsl 

nSmplInSegx = 

Fs 1 x m_nNumFreqBands2 

In order to control the required audio output buffer size, the maximum number of samples in a segment per each 
frequency band is limited as follows: 

• Maximum nSmplInSeg is 256 for sampling frequencies Fs < 48 kHz. 

• Maximum nSmplInSeg is 5 12 for sampling frequencies Fs > 48 kHz. 

NOTE: Notice that for sampling frequencies greater than 96 kHz, the data is split uniformly into 2 frequency 
bands (for 96 kHz < Fs < 192 kHz) or into 4 frequency bands (for 192 kHz < Fs < 384 kHz). Therefore 
the maximum sampling frequency in one frequency band is 96 kHz and consequently the maximum 
nSmplInSeg is the same for all sampling frequencies Fs>48 kHz. 

nBits4SSize (Number of bits used to read segment size) 

nBits4SSize+l is the bit size of the field that contains the size of all data fields in the NAVI table. 
nBandDataCRCEn (Presence of CRC16 within each frequency band) 

When set, this field indicates that checksums were embedded in the frequency band data. This field shall be decoded as 
follows: 



Table 8-2: CRC Presence in Frequency Band 



nBandDataCRCEn 


CRC Presence 


00 


No CRC1 6 within band 


01 


CRC1 6 placed at the end of MSB0 


10 


CRC1 6 placed at end of MSB0 and at end of LSB0 


11 


CRC placed at end of MSB0 and at end of LSB0 and at the end of 
bands 1 ,2 and 3 where they exist 



bScalableLSBs (MSB/LSB split flag) 

This indicates whether MSB/LSB split has been enabled. When bScalableLSBs is true, the MSB/LSB split has been 
performed in frequency band 0. The MSB/LSB split in extension frequency band nBand has been performed only when 
both bScalableLSBs and bMSBLSBSplitEnInExtBands[nBand] are true. 

bOne20neMapChannels2Speakers (Channels to speakers mapping type flag) 

This flag is transmitted as a part of program descriptor within the extension substream header and its definition is 
repeated here for reference. This flag, if true, indicates that each encoded channel represents a signal feed to a 
corresponding loudspeaker on the decode site. If false, it indicates that channels carry the signals that describe the sound 
field but are not the actual loudspeaker feeds. The actual loudspeaker feeds are derived on the decode side using the 
stream embedded coefficients and possibly user-provided adjustment factors. One example of using this mode of 
operation is to carry Ambisonic first-order signals (B-format) W, X, Y and Z. 
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m_RepresentationType (Representation Type) 

This field is transmitted as a part of program descriptor within the extension substream header and its definition is 
repeated here for reference. 

The field is present if bOne20neMapChannels2Speakers is false. It describes the type of representation according to the 
table below. This information may be useful in some post processing tasks. The decoder shall export this information to 
post-processing functions. 



Table 8-3: Representation Types 



m_RepresentationType 


Description 


ObOOO 


Audio Asset for Mixing/Replacement 


0b001 


Reserved 


0b010 


Lt/Rt Encoded for matrix surround decoding implies nTotalNumCHs=2 


0b011 


U/Rh Audio processed for headphone playback implies nTotalNumCHs=2 


0b100 


Reserved 


0b101 - 0b1 1 1 


Reserved 



nBits4ChMask (Channel Position Mask) 

This is the number of bits used to extract the channel mask for each channel set. 'OR'-ing individual channel masks for 
all channel sets yields the overall channel mask for the frame. 

nuFixedLSB Width (MSB/LSB split) 

This field only exists if bScalableLSBs is TRUE. 

• if nuFixedLSBWidth = then length of the LSB part is variable according to pnScalableLSBs[nBand][nCh]; 

• if nuFixedLSBWidth > then length of the LSB part is fixed and equal to nuFixedLSBWidth. 

In both cases the pnScalableLSBs[nBand][nCh] indicates the number of bits used to pack the binary codes representing 
the samples of LSB part. 

Reserved (Reserved) 

This is reserved for supplemental header information. The decoder shall assume that this field is present and of 
unspecified duration. Therefore, in order to continue unpacking the stream, the decoder shall skip over this field using 
the header start pointer and the header size nHeaderSize. 

nCRC16Header (Header CRC16 Protection) 

CRC16 is calculated for the header from positions VersionNum to Byte Align inclusive. 

8.2.2 Channel Set Sub-Header 



Table 8-4: Channel Set Sub-Header 



Syntax 


Size (Bits) 


// Unpack the header size 

nByteOffset += m nChSetHeaderSize = ExtractBits { 10 ) + 1; 

// Extract the number of channels 

m nChSetLLChannel = ExtractBits (4 ) + 1; 

// Extract the residual channel encoding 

m nResidualChEncode = (DTS int64) ExtractBits (m nChSetLLChannel); 

// Extract the input sample bit-width 

m nBitResolution = ExtractBits { 5 ) + 1; 

// Extract the original input sample bit-width 

m nBitWidth = ExtractBits ( 5 ) + 1; 

// Extract the sampling frequency index 

sFreqlndex = ExtractBits (4 ) ; 

// Find the actual sampling frequency 

m nFs = m pnFsTbl [sFreqlndex] ; 

// Extract nFs interpolation multiplier 
m nFsInterpolate = ExtractBits ( 2 ) ; 


10 
4 

nChSetLLChannel 
5 
5 

4 

2 
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Syntax 



Size (Bits) 



// Extract the replacement channel set group 
m nReplacementSet = ExtractBits (2 ) ; 

// Extract the active replacement channel set flag 
if (m_nReplacementSet > 0) 



m bActiveReplaceSet = (ExtractBits (1) 
if ( ImbActiveReplaceSet) 

m_bSkipDecode = true; 



== 1) ? true 



false ; 



// Downmix is allowed only when the encoded channel represents a signal // 
feed to a corresponding loudspeaker (bOne20neMapChannels2Speakers=true ) 
if (m_bOne20neMapChannels2Speakers) { 

// Extract the primary channel set flag 

m_bPrimaryChSet = (ExtractBits (1) == 1) ? true : false; 
// Extract the downmix flag 

m_bDownmixCoef f CodeEmbedded = (ExtractBits (1) == 1) ? true : false; 
// Extract the Embedded Downmix flag 
if (m_bDownmixCoeff CodeEmbedded) 

m_bDownmixEmbedded = (ExtractBits (1) == 1) ? true : false; 
// Extract the Downmix type 

if (m_bDownmixCoeff CodeEmbedded && m_bPrimaryChSet ) 

m_nLLDownmixType = ExtractBits ( 3 ) ; 
// Extract the hierarchical channel set flag 
m_bHierChSet = (ExtractBits (1) == 1) ? true : false; 
if (bDownmixCoeff CodeEmbedded) 

DownmixCoef f s = ExtractBits (nDownmixCoeffs * 9); 
bChMaskEnabled = (ExtractBits (1) ==1) ? true : false; 
// Extract the ch mask 
if (bChMaskEnabled) 
{ 

m_nChMask = ExtractBits (nBits4ChMask) ; 

} 

if (! bChMaskEnabled) { 

for (ch = 0; ch < nChSetLLChannel ; ch++) { 
RadiusDelta [ch] = ExtractBits (9) ; 
Theta[ch] = ExtractBits ( 9 ) ; 
Phi [ch] = ExtractBits (7) ; 



else{ // Case when bOne20neMapChannels2Speakers=f alse . No downmixing is 
// allowed and each channel set is the primary channel set 
bPrimaryChSet = true; 

m_bDownmixCoeff CodeEmbedded = false; 
m_bHierChSet = true; 
m_nNumChPrevHierChSet = ; 
m_nNumDwnMixCodeCoef f s = 0; 
bMappingCoef f sPresent = ExtractBits ( 1 ) ; 
if (bMappingCoeff sPresent) { 

m_nBitsCh2SpkrCoef = ExtractBits ( 3 ) ; 

// Map 0->6, l->8 , ... 7->20 

m_nBitsCh2SpkrCoef =6+2*m_nBitsCh2SpkrCoef 
m_nNumSpeakerConf igs = ExtractBits (2 ) + 1; 
nCof lnd=0 ; 

for (nSpkrConf =0; nSpkrConf <nNumSpeakerConf igs ; nSpkrConf ++) { 
m_pnActiveChannelMask [nSpkrConf ] = ExtractBits (nChSetLLChannel ) ; 
m_pnNumSpeakers [nSpkrConf ] = ExtractBits ( 6 ) + 1; 
bSpkrMaskEnabled = (ExtractBits (1) ==1) ? true: false; 
// Extract the mask 
if (bSpkrMaskEnabled) 

m_nSpkrMask [nSpkrConf ] =ExtractBits (nBits4ChMask ) ; 
for (nSpkr=0; nSpkr<m_pnNumSpeakers [nSpkrConf ] ; nSpkr++) { 
// Extract speaker locations in polar coordinates 
if (! bSpkrMaskEnabled) { 

RadiusDelta [nSpkrConf ] [nCh] = ExtractBits ( 9 ) ; 
Theta [nSpkrConf ] [nCh] = ExtractBits ( 9 ) ; 
Phi [nSpkrConf ] [nCh] = ExtractBits ( 7 ) ; 

} 

// Extract channel to speaker mapping 
// coefficients for each active channel 
for (nCh=0; nCh< nChSetLLChannel; nCh++) { 
if (m_nActiveChannelMask [nSpkrConf ] & (1<< nCh) ) 
m_pnCh2SpkrMapCoef f [nCof Ind++] = 
ExtractBits (m_nBitsCh2SpkrCoef ) ; 



1 

nDownmixCoeffsx9 
1 

nBits4ChMask 



25 per Ch 



3 
2 



nChSetLLChannel 
6 
1 

nBits4ChMask 



25 per speaker 



m_nBitsCh2SpkrCoef 
per active channel 



// End speaker loop indexed by nSpkr 
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Syntax 


Size (Bits) 


} // End speaker configuration loop indexed by nSpkrConf 
} // if (bMappingCoef f sPresent ) 
} // End of bOne20neMapChannels2Speakers=f alse case 
// Extract the num of frequency bands 
if (m nFs > 96000) { 

bXtraFreqBands = ExtractBits ( 1 ) ; 
if (bXtraFreqBands == 1) 

m nNumFreqBands = 4 ; 
else 

m nNumFreqBands = 2 ; 

} 


1 


else 

m nNumFreqBands = 1 ; 
// Unpack the pairwise channel decorrelation enable flag 
/ / for Band 


1 


nBand = ; 

bPWChDecorEnabled [nBand] = ExtractBits ( 1 ) ; 
// Unpack the original channel order 

for (ncBits4ChOrder =0, n = 1; n < m nChSetLLChannel ; ncBits4ChOrder++ , n *= 
2) { 

NULL ; 




} 

if (bPWChDecorEnabled [nBand] == 1) 


Calculated 


{ 

// Unpack the original channel order 

for (nCh = 0; nCh < m nChSetLLChannel; nCh++) 

m nOrigChanOrder [nBand] [nCh] = ExtractBits (ncBits4Ch0rder ) ; 




} 

// The code for bChPFlag is embedded within nPWChPairsCoef f s below 
// For nBand unpack the channel pairwise flags and coefficients 
if (bPWChDecorEnabled [nBand] == 1) 

for (nCh = 0; nCh < (m nChSetLLChannel >> 1); nCh++) { 
bChPFlag [nBand] = ExtractBits ( 1 ) ; 
if (bChPFlag [nBand] == 1) { 

nTmp = ExtractBits ( 7 ) ; // Unpack as unsigned 
// Map to signed 

m anPWChPairsCoef f s [nBand] [nCh] = 
(Tmp & 0x1) ? -(nTmp >> 1) - 1 : nTmp >> 1; 

} 

else 

// Zero out coefficients when channel decorrelation is disabled 
m anPWChPairsCoeffs [nBand] [nCh] = 0; 


1 

7 per Ch pair 
(Conditional) 


} 

// Unpack the optimal adaptive predictor order; 

// one per channel on the frame basis, indicates no prediction 

m nCurrHighestLPCOrder [nBand] = 0; 

for (nCh = 0; nCh < m nChSetLLChannel; nCh++) 




{ 

m pnAdaptPredOrder [nBand] [nCh] = ExtractBits (4 ) ; 
m nCurrHighestLPCOrder [nBand] = 

(m pnAdaptPredOrder [nBand] [nCh] > 

m nCurrHighestLPCOrder [nBand] ) ? m pnAdaptPredOrder [nBand] [nCh] : 
m nCurrHighestLPCOrder [nBand] ; 


4 per Ch 


} 

// Unpack the optimal fixed predictor order per channel 
// on the frame basis 

// Only for channels where m pnAdaptPredOrder [0] [nCh] =0 
// indicates no prediction 

for (nCh = 0; nCh < m nChSetLLChannel; nCh++) 
{ 

if (m_pnAdaptPredOrder [nCh] == 0) 

m pnFixedPredOrder [0] [nCh] = ExtractBits (2 ) ; 
else 

m_pnFixedPredOrder [0] [nCh] = 0; 

} 


2 per Ch 
(Conditional) 


// Unpack adaptive predictor quantized 

// reflection coefficients 

pnTemp = m pnLPCRef ICoef f sQInd [nBand] ; 

for (nCh = 0; nCh < m nChSetLLChannel; nCh++) { 

for (n = 0; n < m pnAdaptPredOrder [nBand] [nCh] ; n++) { 
nTmp = ExtractBits ( 8 ) ; // Unpack as unsigned 

// Map to signed 
*pnTemp++ = (nTmp & 0x1) ? -(nTmp >> 1) - 1 : nTmp >> 1; 

} 

} 


8 bits per coefficient 
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Syntax 



Size (Bits) 



if (bScalableLSBs) 

nLSBFsize [0] = ExtractBits (nBits4SSize ) ; 
if <m_nBitWidth > 16) 

m_nBits4ABIT = 5 ; 
else if (m_nBitWidth > 8) 

m_nBits4ABIT = 4 ; 
else 

m_nBits4ABIT =3; // for m_nBits4ABIT=5 case it has been taken care by 
// LSB/MSB spliting 
if (nNumChSetsInFrame > 1 && m_nBits4ABIT < 5) 

m_nBits4ABIT += 1; // to take care of ES saturation 
if (bScalableLSBs) { 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) 
m_pnScalableLSBs [0] [nCh] = ExtractBits (4 ) ; 

} 

else { 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) 
m_pnScalableLSBs [0] [nCh] = 0; 

} 

if (bScalableLSBs) { 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) 
m_pnBitWidthAdj PerCh [0] [nCh] = ExtractBits (4 ) ; 

} 

else { 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) 
m_ pnBitWidthAdjPerCh [0] [nCh] = ; 

} 

// Extract extra frequency band parameters 
for (nBand = 1; nBand < m_nNumFreqBands ; nBand++) { 
// Unpack the pairwise channel decorrelation enable 

// flag for extension bands 
bPWChDecorEnabled [nBand] = ExtractBits ( 1 ) ; 
// Unpack the original channel order 
if (bPWChDecorEnabled [nBand] == 1) { 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) 

m_nOrigChanOrder [nBand] [nCh] = ExtractBits (ncBits4ChOrder) ; 

} 

// The code for bChPFlag is embedded within 
// nPWChPairsCoef f s below 

// For an extension band unpack the channel pairwise 
// flags and if true the corresponding coefficients 
if (bPWChDecorEnabled [nBand] == 1) { 

for (nCh = 0; nCh < (m_nChSetLLChannel >> 1); nCh++) { 
bChPFlag [nBand] = ExtractBits ( 1 ) ; 
if (bChPFlag [nBand] == 1) { 

nTmp = ExtractBits (7) ; // Unpack as unsigned 
m_anPWChPairsCoef f s [nBand] [nCh] = 
(nTmp & 0x1) ? -(nTmp >> 1) - 1 : nTmp >> 1; // Map to signed 

} 

else 

// Zero out coefficients when channel decorrelation is disabled 
m anPWChPairsCoeffs [nBand] [nCh] = ; 



// Unpack the optimal adaptive predictor order; 

// one per channel on the frame basis 

// indicates no prediction 

m_nCurrHighestLPCOrder [nBand] = ; 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) { 

m_pnAdaptPredOrder [nBand] [nCh] = ExtractBits (4 ) ; 
m_nCurrHighestLPCOrder [nBand] = 
(m_pnAdaptPredOrder [nBand] [nCh] > m_nCurrHighestLPCOrder [nBand] 
m_pnAdaptPredOrder [nBand] [nCh] : m_nCurrHighestLPCOrder [nBand] ; 

} 

// Unpack the optimal fixed predictor order per 
// channel on the frame basis 

// Only for channels where m_pnAdaptPredOrder [nCh] =0 
// indicates no prediction 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) { 
if (m_pnAdaptPredOrder [nBand] [nCh] == 0) 

m_pnFixedPredOrder [nBand] [nCh] = ExtractBits (2 ) ; 
else 

m_pnFixedPredOrder [nBand] [nCh] = ; 

} 

// Unpack adaptive predictor quantized 

// reflection coefficients 

pnTemp = m_pnLPCRef ICoef f sQInd [nBand] ; 



nBits4Ssize 
(Conditional) 



4 per Ch 
(Conditional) 



4 per Ch (Conditional) 



Calculated 



1 per Ch pair 



7 per Ch pair 
(Conditional) 



4 per Ch 



2 per Ch 
(Conditional) 



pnAdaptPredOrder[ch] 
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Syntax 



Size (Bits) 



for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) { 

for (n=0; n<m_pnAdaptPredOrder [nBand] [nCh] ; n++) { 
nTmp = ExtractBits (8) ; // Unpack as unsigned 
*pnTemp++ = (nTmp & 0x1) ? - (nTmp >> 1) - 1 : nTmp 

} 

} 

if (m_bDownmixEmbedded) 

m_bEmbDMixInExtBand [nBand] = ExtractBits ( 1 ) ; 
m_bMSBLSBSplitEnInExtBands [nBand] = ExtractBits ( 1 ) ; 
if (bMSBLSBSplitEnlnExtBands [nBand] ) 

nLSBFsize [nBand] = ExtractBits (nBits4SSize ) ; 
else 

nLSBFsize [nBand] = ; 
if (bMSBLSBSplitEnlnExtBands [nBand] ) { 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) 
m_pnScalableLSBs [nBand] [nCh] = ExtractBits (4 ) ; 

} 

else { 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) 
m_pnScalableLSBs [nBand] [nCh] = 0; 

} 

m_bFlagScalableResExtBand [nBand] = ExtractBits ( 1 ) ; 
if (m_bFlagScalableResExtBand [nBand] ) { 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) 

m_pnBitWidthAdjPerCh [nBand] [nCh] = ExtractBits (4 ) ; 

} 

else{ 

for (nCh = 0; nCh < m_nChSetLLChannel ; nCh++) 
m_pnBitWidthAdj PerCh [nBand] [nCh] = 0; 

} 

} // End Extra frequency band loop 
Reserved = ExtractBits (...) ; 
ByteAlign = ExtractBits ( ... 7) ; 
nCRC16SubHeader = ExtractBits ( 16 ) ; 



per Ch 



1 (Conditional) 
1 (Conditional) 

nBits4Ssize 
(Conditional) 



4 per Ch 
(Conditional) 



4 per Ch (Conditional) 



0...7 
16 



nChSetHeaderSize (Size of Channel Set Sub-header) 

nChSetHeaderSize+1 is the size of the channel set sub-header in bytes. Use this to locate the sub-header of the next 
channel set. 

nChSetLLChannel (Number of Channels in Set) 

This indicates the number of channels in the channel set. 
nResidualChEncode (Residual Type) 

This is an array of bits (one bit per encoded channel) indicating the nature of the residual signal that is encoded in each 
of the encoded channels. For a particular encoded channel the value of corresponding bit in nResidualChEncode equal 
to '0' indicates that the residual in that channel is obtained by subtracting the lossy decoded audio from the original 
audio. For a particular encoded channel the value of corresponding bit in nResidualChEncode equal to '1' indicates that 
the residual in that channel is the original audio. Bits in the array nResidualChEncode are ordered according to the 
channel numbers where the highest channel number corresponds to the most significant bit. 

nBitResolution (PCM Bit Resolution) 

This expresses the original resolution of the PCM audio source as output by an A/D converter. 
nBitWidth (Storage Unit Width) 

This specifies the bit width of the storage media. For example, the PCM audio with nBitResolution=20 would be stored 
in wav files with the nBitWidth equal to 24. In the lossless decoder, in order to recreate the wav file that matches the 
original wav file, the decoded samples shall be shifted to the right by the amount equal to nBitWidth - nBitResolution. 

sFreqlndex (original sampling frequency) 

This is the sample rate of audio in a channel set. It corresponds to the resulting sampling frequency after interpolation 
by the nFsInterpolate factor, on the encode side. The 4-bit sFreqlndex field is interpreted as described in Table 8-5. 
Remember that certain extra fields are encountered in the data stream once the sample rate is greater than 96 kHz. 
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Table 8-5: sFreqlndex Sample Rate Decoding 
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10 


12 000 


11 


24 000 


12 


48 000 


13 


96 000 


14 


192 000 


15 


384 000 



nFsInterpolate (sampling frequency modifier) 

This specifies the sampling rate interpolation factor of the original signal. The sampling rate adjustment when mixing 
channel sets with different sampling rates, it specifies the interpolation factor applied to the original sampling 
frequency. This 2-bit code (see Table 8-6) encodes the Mvalue or Interpolation factor applied to the original sampling 
frequency. 



Table 8-6: Sampling Rate Interpolation 



nFsInterpolate 


Mvalue 


00 


1 


01 


2 


10 


4 


11 


8 



NReplacementSet 

This is an indicator of which replacement set this set is a member of, (see Table 8-7). There are a maximum of 3 
replacement sets. 



Table 8-7: Replacement Set Association 



nRepiacementSet 


Association 


00 


Not a replacement 


01 


Member of set 1 


10 


L Member of set 2 


11 


Member of set 3 



bActiveReplaceSet (Default replacement set) 

This is present only if nRepiacementSet != 0. In particular, bActiveReplaceSet =true indicates that the current channel 
set is the active channel set. 

bPrimaryChSet 

This is present only if bOne20neMapChannels2Speakers is true. The bPrimaryChSet=true indicates that the set is a 
primary channel set. The primary channel sets represent the subset of all encoded channels, i.e. it contains data for 5.1 
audio or a stereo downmix. 
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bDownmixCoeffCodeEmbedded (Downmix coefficients present in stream) 

This is present only if bOne20neMapChannels2Speakers is true. It indicates whether a matrix of downmix coefficients 
has been defined and is embedded in the stream. 

bDownmixEmbedded (Downmix already performed by encoder) 

This is present only if bOne20neMapChannels2Speakers is true. If bDownmixCoeffEmbedded is set, then extract 
bDownmixEmbedded. When bDownmixEmbedded=true this indicates to the decoder that on the encode side, audio in 
the channels of the current channel set has been downmixed to channels in the lower channels sets indexed. After 
decoding the current channel set, the above mentioned encoder downmix operation needs to be undone in the decoder. 
If bDownmixEmbedded=false, the encoder did not perform the downmixing operation on the current channel. 

nLLDownmixType 

This is present only if bOne20neMapChannels2Speakers is true. It indicates the downmix processing type for the 
primary ChSet group. The downmix action is defined in Table 8-8. 



Table 8-8: Downmix Type 



nLLDownmixType 


Downmix primary ChSet group to 


000 


1/0 


001 


Lo/Ro 


010 


Lt/Rt 


011 


3/0 


100 


2/1 


101 


2/2 


110 


3/1 


111 


Unused 



BHierChSet 

This is present only if bOne20neMapChannels2Speakers is true. It indicates whether the channel set is part of a 
hierarchy. 

DownmixCoeffs 

This is present only if bOne20neMapChannels2Speakers is true. If bDownmixCoefCodeEmbedded SetDownmix = 
true, then extract nDownmixCoeffs. To extract the coefficients, the decoder needs to determine the number of downmix 
coefficients (NDownmixCoeffs) by calculating the size of the NxM table of coefficients. N is defined as 
nChSetLLChannel+1 rows, where the extra row represents the down scaling coefficients that prevent overflow. 
Conversely, the number of columns M (the number of channels that the current channel set is mixed into) is determined 
when the decoder decodes the channel set hierarchy down to the primary channel set. 

Consider the case where there are 2 extended channel sets and 1 primary channel set (a total of 3 channel sets). When 
the decoder is currently unpacking the downmix coefficients defined in the second extended channel set (4 channels), N 
is 4+1=5. To determine M, the decoder shall traverse the channel set hierarchy and count the number of channels within 
each channel set. Hence, there are 6 channels in the primary set and 2 channels in the first extended channel set. 
Therefore M=8 and thus 5*8 coefficients. 

When downmix coefficients are defined for the primary channel group, use the nLLDownmixType to determine M 
(number of resultant fold down channels) and the total number of channels in the primary channel group to determine 
N. 

Coding of the downmix coefficients is described in clause C.8. 
bChMaskEnabled (Channel Mask Enabled) 

This is present only if bOne20neMapChannels2Speakers is true. When set, it indicates that the channel set is using a 
predefined channel mask. Otherwise a user specified polar coordinate speaker configuration is applied. 

nChMask (Channel Mask for Set) 

This is present only if bOne20neMapChannels2Speakers is true. When bChMaskEnabled is set, this field indicates 
which pre-defined channel positions apply. (See clause C.7 for details). 
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ChSetSpeakerConfiguration (Angular Speaker Position Table) 

This is present only if bOne20neMapChannels2Speakers is true. If bChMaskEnabled==false, extract the 
speaker/channel configuration for the channel set. Each speaker (or channel) location is expressed in spherical 
coordinates (delta, theta, phi), where the origin is the location of the listener. The centre of each speaker is ideally 
located on the surface of the sphere surrounding the listener. Differences in speaker radii are stored as deltas from this 
imaginary spherical surface. The radius has a range [-510, 510] with resolution of 2 cm (9 bits). The angle theta is 
expressed as [-180, 180] with a resolution of 1 degree (9 bits). The angle phi is [-90, 90] with a resolution of 2 degrees 
(7 bits). The speakers each correspond to channels defined in the previous channel reference. After all objects have been 
extracted, the master speaker configuration can be created. The first speaker in the channel set defines the spherical 
surface and all other speakers have radii expressed as deltas relative to this first channel. 

bMappingCoeffsPresent (Mapping Coefficient Present Flag) 

The bMappingCoeffsPresent is present only if bOne20neMapChannels2Speakers is false. When 
bMappingCoeffsPresent is true, it indicates that channel-to-speaker mapping coefficients are present in the stream. 

When the bOne20neMapChannels2Speakers is false, the encoded channels carry the signals that describe the sound 
field but are not necessarily the actual loudspeaker feeds. The actual loudspeaker feeds are derived on the decode side 
using the channel-to-speaker mapping coefficients and possibly user-provided adjustment factors. For certain types of 
representation, the channel-to-speaker mapping coefficients are not needed (i.e. L h /R h ) or channel-to-speaker mapping 
coefficients are provided by some other metadata fields (like mixing data for audio assets). In those cases, the parameter 
bMappingCoeffsPresent may be set to false. Consequently no channel-to-speaker mapping coefficients and associated 
bit fields will be present in the stream. 

nBitsCh2SpkrCoef 

This is present only if bOne20neMapChannels2Speakers is false and bMappingCoeffsPresent is true. This parameter 
represents a code for a number of bits used to pack each channel-to-speaker mapping coefficient. The actual number of 
bits is obtained from mapping: 

w-_MS^£h2JsfcfCo?/ = *f 2 - m_nR!ts£h2fykrCa?f 

nNumSpeakerConfigs (Number of Loudspeaker Configurations) 

This is present only if bOne20neMapChannels2Speakers is false and bMappingCoeffsPresent is true. Generally, the 
signals that describe the sound field- and are transmitted in a stream as channels-may be used to create speaker feeds for 
an arbitrary 3D speaker configuration. In some cases there exist exact mathematical relationships between the 
coefficients for different loudspeaker configurations. However, the content creators may wish to perform small 
adjustments to these relationships. In that case, it is beneficial to transmit separate sets of channel-to-speaker mapping 
coefficients for different speaker configurations (e.g. one set for 5.1 and one set for 7.1). The parameter 
nNumSpeakerConfigs indicates this number of different speaker configurations and consequently, the number of sets of 
channel-to-speaker mapping coefficients. 

pnActiveChannelMask (Active channel mask for current loudspeaker configuration) 

This is present only if bOne20neMapChannels2Speakers is false and bMappingCoeffsPresent is true. This bit mask 
indicates the activity of each channel in the particular speaker configuration mapping. The least significant bit 
corresponds to channel and the most significant bit corresponds to channel nChSetLLChannel -1. For some speaker 
configurations, not all of the encoded channel signals are used in channel-to-speaker mappings. For example, in 
Ambisonic B-format there are four signals (W, X, Y and Z) that would be coded as four channels. The signal Z, in 
particular, carries information about the sound field component along the z-axis in Cartesian coordinates. Therefore, 
mapping to a standard 5 . 1 loudspeaker layout would not involve the Z channel. Consequently the mask would be 1 1 1 . 

pnNumSpeakers 

This is present only if bOne20neMapChannels2Speakers is false and bMappingCoeffsPresent is true. This parameter 
represents a number of speakers in a current loudspeaker configuration. 

bSpkrMaskEnabled 

This is present only if bOne20neMapChannels2Speakers is false and bMappingCoeffsPresent is true. 
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Each of the nNumSpeakerConfigs speaker configurations, for which the channel-to-speaker mapping coefficients are 
defined, may be described using either: 

• the predefined loudspeaker mask (bSpkrMaskEnabled is true); or 

• the speaker positions in polar coordinates (bSpkrMaskEnabled is false). 
nSpkrMask (Speaker mask for current loudspeaker configuration) 

This is present only if bOne20neMapChannels2Speakers is false, bMappingCoeffsPresent is true and the 
bSpkrMaskEnabled is true. It indicates which pre-defined loudspeaker positions define the current speaker 
configuration. See clause C.7 for details. One of the available configurations shall be a standard 5.1 layout (C, L, R, Ls, 
Rs, LFE1). 

ChSetSpeakerConfiguration (Angular speaker position table for current loudspeaker configuration) 

This is present only if bOne20neMapChannels2Speakers is false and bMappingCoeffsPresent is true. If 
bSpkrMaskEnabled is false, extract the speaker position for the current loudspeaker configuration. Coding of 
loudspeaker location parameters is described in clause C.3. 

pnCh2SpkrMapCoeff (Channel to loudspeaker mapping coefficients) 

This is present only if bOne20neMapChannels2Speakers is false and bMappingCoeffsPresent is true. 

The pnCh2SpkrMapCoeff represents the set of channel-to-speaker mapping coefficients, one for each speaker of each 
loudspeaker configuration and for each active channel. Coefficients are represented as signed numbers in 
Q3.(m_nBitsCh2SpkrCoef-4) fixed point format, i.e. the actual coefficients in range from -8 to 8 are related to the 

P nCh2SpkrM apCoeff[n] 

transmitted coefficient pnCh2SpkrMapCoeff [n] as 2< m - nBitsCh 2SpkrCoef 4) . The mapping equation for the 
loudspeaker S k and active channels Chj j=0,l, M is as follows: 

j=M 

Sk = J] Coe ffkj-Chj 
j=o 

where the Coeff k - denotes the coefficient corresponding to the mapping from j-th active channel to the fc-th loudspeaker 
in the current loudspeaker configuration. 

bXtraFreqBands (Indicates extra frequency bands when the sample rate is greater than 96 kHz) 

If the sample frequency (m_nFs) is greater than 96 kHz, then this boolean flag exists and it indicates the number of 
extra frequency bands present in the stream. 

A value of T indicates that full bandwidth is preserved. In particular, the number of frequency bands that are present in 
the stream (m_nNumFreqBands) is: 

12 for 96 kHz < m_nFs< 192 kHz 
m nNumFrecBands=^ 

|4 for 192 kHz<m_nFs<384 kHz 

A value of '0' indicates that only one-half of the original bandwidth is preserved. In this case: 

fl for 96 kHz < m_nFs < 192 kHz 
m nNumFrecBands=^ 

|2 forl92 kHz<m_nFs<384 kHz 

For sample frequencies below or equal to 96 kHz, this field is not present and the number of frequency bands is one by 
default. 

bPWChDecorEnabled[0] (Pairwise Channel Decorrelation for frequency band 0) 

When this is set, one or more channel pairs have been processed with pairwise channel decorrelation. This type of 
processing is performed separately for each frequency extension band. 
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nOrigChanOrder[0] (Original channel order for frequency band 0) 

If bPWChDecorEnabled[0] is set, unpack the original channel order for each channel in the channel set. If 
bPWChDecorEnabled[0] is set (at the encoder), the input channels have been grouped into pairs for the channel 
decorrelation process. In this case the original channel order is included in the bit stream. After lossless residual 
decoding, the original channel order shall be restored before either combining with the lossy output or outputting the 
residual itself when no core is present. The width of this field is calculated based on examining the first bit set size of 
nChSetLLChannel. 

bChPFlag[0] (Channel pairwise flags in frequency band 0) 

If bPWChDecorEnabled[0] is set, unpack the channel pairwise flags for each channel in the set. If bChPFlag is set, then 
a coefficient is included in the stream. 

nPWChPairsCoeffs[0] (Pairwise Channel Coefficients for frequency band 0) 

If bChPFlag for channel is set, unpack the pairwise decorrelation coefficients for that channel. This coefficient should 
be converted to a signed number and used to scale the source channel before adding the scaled version to the destination 
channel. 

pnAdaptPredOrder[0] (Adaptive predictor order in frequency band for each channel) 

This is an adaptive predictor order per channel. A zero indicates no prediction. 

pnFixedPredOrder[0] (Fixed predictor order for frequency band 0) 

This is a fixed predictor order per channel in frequency band 0. A zero indicates no prediction. 

pnLPCReflCoeffsQIndfO] (Adaptive predictor quantized reflection coefficients in frequency band 0) 

This is present only if the adaptive prediction order is not zero. 

nLSBFsize[0] (Size of the LSB section in any segment of frequency band 0) 

When bScalableLSBs is set, the size of the LSB data, which is the same in all segments of frequency band 0, will be 
present in the stream. Since the size of the entire channel set in a particular segment of frequency band is already 
defined in the NAVI table, the difference will indicate the size of the MSB section for that segment. 

pnScalableLSBs[0] (Number of bits used to represent the samples in LSB part of frequency band 0; one per 
channel) 

If bScalableLSBs is set, extract the pnScalableLSBs[0][nCh] for each channel nCh in a channel set. This is the number 
of bits used for representing the samples in LSB part of frequency band in channel nCh. 

pnBitWidthAdjPerCh[0] (Number of bits discarded by authoring in frequency band 0) 

This is the number of bits that an authoring tool discarded from the frequency band for bit rate management. If 
bScalableLSBs is set, extract pnBitWidthAdjPerCh for each channel in the channel set. Normally, in the encoded 
master, this field has value 0. After bit rate management, this value is used to pad out the decoded audio to ensure the 
number formats of all decoded audio are alike before downmix reversal. 

bPWChDecorEnabledfnBand] (Pairwise Channel Decorrelation for frequency extension band nBand) 

This is present only when m_nNumFreqBands>l. When set for a particular frequency extension band, one or more 
channel pairs have been processed with pairwise channel decorrelation. This type of processing is performed separately 
for each frequency extension band. 

nOrigChanOrder[nBand] (Original channel order for extension frequency band) 

If bPWChDecorEnabled [nBand] is set, unpack the original channel order for each channel in the channel set. If 
bPWChDecorEnabled[nBand] is set (at the encoder), the input channels have been grouped into pairs for the channel 
decorrelation process. In this case the original channel order is included in the bit stream. After lossless residual 
decoding, the original channel order shall be restored before either combining with the lossy output or outputting the 
residual itself when no core is present. The width of this field is calculated based on examining the first bit set size of 
nChSetLLChannel. 
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bChPFlag[nBand] (Channel pairwise flags in frequency extension band nBand) 

When the m_nNumFreqBands>l and when the bPWChDecorEnabled[nBand] is set, unpack the channel pairwise flags 
for each channel in the set. 

nPWChPairsCoeffs[nBand] (Pairwise Channel Coefficients for frequency extension band nBand) 

When the m_nNumFreqBands>l and when bChPFlag[nBand[nCh] for channel nCh is set, unpack the pairwise 
decorrelation coefficients for that channel. This coefficient should be converted to a signed number and used to scale 
the source channel before adding the scaled version to the destination channel. 

pnAdaptPredOrder[nBand] (Adaptive predictor order for extra frequency band nBand) 

This is present only when m_nNumFreqBands>l. It indicates the adaptive predictor order per channel. A zero indicates 
no prediction. 

pnFixedPredOrderfnBand] (Fixed predictor order for extra frequency band nBand) 

This is present only when m_nNumFreqBands>l and the pnAdaptPredOrder[nBand][nCh]=0. It indicates the fixed 
predictor order per channel nCh in extra frequency band nBand. A zero indicates no prediction. 

pnLPCReflCoeffsQIndfnBand] (Adaptive predictor quantized reflection coefficients in extra frequency band 
nBand) 

This is present only if the adaptive prediction order is not zero. 

bEmbDMixInExtBand[nBand] (MSB/LSB split flag in extension frequency band) 

This is present only when the m_nNumFreqBands>l and the bDownmixEmbedded is true. When the 
bEmbDMixInExtBand[nBand] is true, it indicates that the embedded downmix has been performed in frequency band 
nBand. If bEmbDMixInExtBand is false for all extension frequency bands, then the downmix is embedded only in 
frequency band and the samples in extension frequency bands are scaled accordingly. 

bMSBLSBSplitEnInExtBands[nBand] (MSB/LSB split flag in extension frequency band) 

This is present only when the m_nNumFreqBands>l and it indicates whether the MSB/LSB split has been performed in 
frequency band nBand. 

nLSBFsizefnBand] (Size of the LSB section in extension frequency band nBand) 

This is present only when the bMSBLSBSplitEnInExtBands[nBand] is true and it indicates the size of the LSB data, 
which is the same in all segments of one frequency band. Since the size of the entire channel set in a particular segment 
of a frequency band (indexed by nBand) is already defined in the NAVI table, the difference will indicate the size of the 
MSB section in a particular segment of the nBand-th frequency band. 

pnScalableLSBsfnBand] (Number of bits used to represent the samples in LSB part of frequency band nBand; 
one per channel) 

If bMSBLSBSplitEnInExtBands[nBand] is true, extract the pnScalableLSBs[nBand][nCh] for each channel nCh in a 
channel set. This is the number of bits used for representing the samples in the LSB part of frequency band nBand in 
channel nCh. 

bFlagScalableResExtBandfnBand] (Scalable Resolution in Extension Band Enable Flag) 

When the m_nNumFreqBands>l and when the bFlagScalableResExtBand is set, the reduction of bit width of extension 
band data may be performed during the authoring process in order to reduce the bit rate requirements. 

pnBitWidthAdjPerChfnBand] (Number of bits discarded by authoring in extra frequency band nBand) 

This field is present only in the case when m_nNumFreqBands>l and bFlagScalableResExtBandfnBand] is true. This is 
the count of the number of bits that an authoring tool discarded for bit-rate management from the extra frequency band 
nBand. Notice that unlike the case of frequency band 0, where the bit width reduction is only allowed in the LSB part of 
the data, the bit width reduction of extension frequency band data may occur in LSB as well as the MSB parts. 
Consequently, in extension frequency bands the bit-rate management may be performed even when the MSB/LSB split 
is not enabled (bFlagScalableResExtBandfnBand] is false). 
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For each extra frequency band (nBand), read pnBitWidthAdjPerCh[nBand] for each channel in the channel set. 
Normally, in the encoded master this field has value 0. After bit-rate management, this value is used to pad out the 
decoded audio to ensure the number formats of all decoded frequency bands are alike before final filter bank 
interpolation. 

Reserved (Reserved) 

This is reserved for supplemental channel set header information. The decoder shall assume that this field is present and 
of unspecified duration. Therefore in order to continue unpacking, the stream decoder shall skip over this field using the 
channel set header start pointer and the channel set header size nChSetHeaderSize. 

nCRC16SubHeader (CRC16 of channel set sub-header) 

This is the CRC16 of the entire channel set header from positions nChSetHeaderFSize to Byte Align inclusive. 

8.2.3 Navigation Index Table 

The navigation index table, or NAVI, is comprised of the sizes of individual frequency bands. It is described in more 
detail in clause 8.3.1. 

8.2.4 Frequency Bands 

Table 8-9 details the composition of the frequency bands. 



Table 8-9: Frequency Bands 



Syntax 


Size (Bits) 


FreqBandData = ExtractBits (var) ; 


Var 


ByteAlign = ExtractBits { ... 7) ; 


0...7 



FreqBandData (Frequency Band Data) 

This contains the frequency band data. 

8.3 Lossless Stream Synchronization & Navigation 

Intrinsic features of the DTSXLL stream format are its synchronization, navigation, error handling and recovery 
abilities. The following issues are addressed: 

• Navigation Index 

• Stream Navigation 

• Error Detection 

• Error Resilience 

8.3.1 Navigation Index 

The navigation index (NAVI) depicted in Figure 8-4 contains the size of each of the individual components contained in 
the lossless stream. The number of entries in the NAVI table is determined by the number of frequency bands, the 
number of segments and the number of channel sets. Also note that NOT all channel sets will have components for all 
frequency bands. As a result, the number of frequency bands designated for each channel set sub-header should be used 
in the calculation of the overall number of entries. Additionally the nBits4Ssize field globally determines the bit width 
of each individual entry in the index. 
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Figure 8-4: Navigation Index 



Since the NAVI data is essential to reliable decoding of the frame and error recovery, the data is protected by a CRC16 
checksum. It is easy to see that random access to any point in the stream can be achieved by the summation of prior 
entries in the index. For example, pseudo-code to illustrate the index packing order is given below. 

M = numberof segments ; 

N = numberof channelsets ; 

P = number_of _f requency_bands ; 

for (Band=0; Band<P; Band++) { 
BandSize [Band] =0 ; 
for (Seg=0;Seg<M;Seg++) { 

SegmentSize [Band] [Seg] =ExtractBits (nBits4Ssize) ; 
for <ChSET=0 ; ChSET<N; ChSET++){ 

if { GetNumFreqBand4ChSet [ChSET] > Band ) { 

BandChSetSize [Band] [Seg] [ChSET] =ExtractBits (nBits4Ssize) +1 ; 
SegmentSize [Band] [Seg] += BandChSetSize [Band] [Seg] [ChSET] ; 
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} 

BandSize [Band] += SegmentSize [Band] [Seg] ; 

} 

} 

// Re-align bit pointer to byte position 
bitptr = next_byte_position_in_stream ( ) ; 
Checksum = ExtractBits ( 16 ) ; 



The NAVI CRC16 is computed for all data expressed in the listing above, inclusive of any byte alignment fields. When 
byte alignment is applied, the value inserted in the stream shall be zero. 

8.3.2 Stream Navigation 

Navigation to the checksums in both the header and channel set sub header is achieved by reading the size and 
subtracting the width of the checksum field. After offsetting the pointer from the present position, the checksum field is 
then immediately found. 

Navigation to the start position of the first segment in the frame is obtained by summing the size of the header, the size 
of each channel set sub-header and the computed size of the NAVI table. 

At the segment level, all channel sets are packed in numerically-increasing sequential order. For instance, the first 
channel set in the segment is the primary channel set (5.1) and the second set in the segment contains the extended 
channel (ES) set. 

Navigation from one frequency band to the next sequential frequency band is achieved by computing the sum of all 
bands (or all segments) within the current frequency band and advancing by this offset to the next frequency band. 

Navigation from one segment of a frequency band to the next sequential segment in the same frequency band is 
achieved by computing the sum of all bands for all channel sets in the present segment set and advancing by this offset 
to the next segment. 

Navigation from one packed channel set to the next sequential channel set in the same frequency band and segment is 
achieved by using the band size of the current channel set and advancing by this offset to the next channel set. 

Navigation within a band to the start of an LSB band data is achieved by subtracting the LSB band size located in the 
channel set sub-header from the frequency band size indicated in the navigation index. 

8.3.3 Error Detection 

Checksums are the principle mode for verifying that data is intact. Error detection is assisted by the presence of CRC 
fields at key points within the lossless data stream. The CRC fields validate the data just read, but they also play a 
significant role in verifying that the stream synchronization is correct. The checksum of the headers is handled as 
described below. Note that when a checksum is included in the validation check, the result is zero. 

The common headers, channel set sub-header and NAVI table are protected by performing a CRC 16 checksum 
generation across all the header data, inclusive of byte alignment. When the checksum is included as part of the 
evaluation range, the output from the CRC 16 detection algorithm will be zero if no fault condition is present. Any non- 
zero value indicates a fault. 

The frequency bands can be checked for errors in two ways: 

• First, by ensuring that-after unpacking-the bit position travelled expressed as a whole number of bytes 
correlates with the band size extracted from NAVI; and/or 

• Second, by computing a CRC 16 for the entire frequency band data and comparing with an optional stored 
CRC16 checksum. This checksum when flagged by nBandDataCRCEn could be positioned in the frequency 
band data at one of three locations: 

1) theendofMSBO; 

2) the end of MSBO and at end of LSBO; or 

3) the end of MSBO and at end of LSBO and at the end of bands 1 , 2 and 3, where they exist. 
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Each frequency band terminates on a byte boundary and is zero padded until this condition is met. The optional 
checksum (CRC16) may then be placed at this position. The value stored in the NAVI table will be representative of the 
size for all frequency band data and byte alignments and will indicate whether the checksum is present or not. 

8.3.4 Error Resilience 

When an error condition has occurred, the decoder behaviour shall be according to Table 8-10 for the fault conditions 
listed. 



Table 8-10. Error Handling 



Fault 


Solution 


Header is corrupt 


Advance to the next frame by returning to sync detection 


Any channel set header is corrupt 


Advance to the next frame by returning to sync detection 


NAVI is corrupt 


Advance to the next frame by returning to sync detection 


Invalid entry in any header field 


Advance to the next frame by returning to sync detection 


1 st frequency band data is corrupt 


Advance to next segment if possible. Otherwise, advance to the 
next frame 


2nd frequency band data is corrupt 


Render the 1st frequency band and zero all subsequent 
frequency band data 


3rd frequency band data is corrupt 


Render the 1st and 2nd frequency bands and zero all 
subsequent frequency band data 


4th frequency band data is corrupt 


Render the 1 st , 2 na and 3 ra frequency bands and zero the 4th 
frequency band data 


Any other fault condition 


Advance to the next frame by returning to sync detection 



8.4 Lossless Stream Decoding 

The data stream input to the decoder consists of both lossy (core) and lossless content. If the stream contains both lossy 
and lossless components, a core decoder will decode only the lossy part, whereas a lossless decoder will reconstruct the 
residual signal and combine it with the lossy part before output. If the stream does not contain a backward-compatible 
lossy stream, then only the lossless decoder will be active. 

There can be additional frequency band data that contain information for increasing the sample rate. The total number 
of frequency bands is deduced from the field bXtraFreqBands, which is only present in the channel set sub-header when 
sFreqlndex is greater than 96 kHz. When this field is set, it will indicate that the total number of frequency bands 
(m_nNumFreqBands) is: 

f 2 for 96 kHz < m_nFs < 192 kHz 
m nNumFrecBands = ^ 

|4 for 192 kHz<m_nFs<384 kHz 

Otherwise, if clear, it indicates that the total number of frequency bands is: 

fl for 96 kHz < m_nFs < 192 kHz 
m nNumFrecBands = ^ 

|2 for 192 kHz<m_nFs<384 kHz 

The following diagram illustrates how to unpack frequency band data. 

A decoder capable of decoding the lossless extension (Figure 8-5) will perform the following tasks: 

• Unpack frequency band data - stream parsing. 

• Inverse fixed prediction - linear signal reconstruction. 

• Inverse adaptive prediction - adaptive signal reconstruction. 

• Inverse pairwise channel de-correlation - scaling and reordering of channels. 

• Reconstruct PCM samples - summation of core and lossless samples. 
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Figure 8-5: DTS Lossless Decoder 



8.4.1 Band Data 

By default, the first band in a channel set contains the encoded data for sampling frequencies less than or equal to 
96 kHz. 

There are two types of data packing, as indicated by ncSegtype: 

• Type 0, for which the coding parameters (bit allocation, etc.) are unique for each channel in the frequency 
band. The block of Rice code flags for each channel in the channel set comes first. The block of corresponding 
coding parameters for each channel in the channel set comes next. This is followed by the block of entropy 
codes for all residual samples in the segment for all channels of the channel set. 

• Type 1, for which all channels in the frequency band use a common set of coding parameters. The Type 1 
segment is unpacked in a similar way to Type 0, except that for all channels in the channel set, a common Rice 
code flag and the common ABIT value(s) appear in the stream before the block of entropy codes. 

Regardless of the data packing type, the data in segment is further sub-divided into two parts: 

• Parti -Start of PCM. 



Part 2 - ADPCM. 
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To ensure that frames can be edited independently, ADPCM history is not carried between the frames. Instead, at the 
beginning of each frame (in the Part 1 section of segment 0), adaptive prediction is disabled and re-enabled only after a 
sufficient number of PCM samples are available for initialization of the predictor's delay line. The prediction residuals 
obtained after enabling adaptive prediction constitute the Part 2 section. Only segment is permitted to have both Part 1 
and Part 2 sections. All other segments shall be entirely comprised of Part 2 data. The number of samples in Part 1 is 
dependent on the order of the adaptive predictor that is used in each particular channel and the segment type 
(ncSegtype). In particular, the number of samples in Part 1 of segment is determined as follows: 

• ncSegtype = 0; pnAdaptPredOrder[nCh] for nCh=0,. . . nChSetLLChannel; 

• ncSegtype = 1 ; m_nCurrHighestLPCOrder for all channels in the current channel set; 

where nCurrHighestLPCOrder denotes the highest ADPCM order among all the channels of the current channel set. 

8.4.1 .1 Unpacking Frequency Band Data 

The data in all bands is packed according to the same stream syntax. 

The following flowcharts and code detail the syntax of the band data whose composition dynamically varies as the data 
is read. 

The band data can consist of four formats: 

• Linear-encoded MSBs. 

• RICE-encoded MSBs. 

• "Hybrid" RICE-encoded MSBs. 

• Linear-encoded LSBs. 

The following pseudo-code illustrates this procedure: 

// Start unpacking MSB portion of segment 
if { nSeg != ) 

bUseSegStateCodeParm = (ExtractBits (1) == 1 ) ? true : false; 
// Unpack segment type: 

// - --> implies use of distinct coding parameter for each channel 
// - 1 --> implies use of common coding parameter for all channel 
if { ! bUseSegStateCodeParm ) 

m_ncSegType = ExtractBits ( 1 ) ; 
// Determine num of coding parameter sets encoded in segment 
// For segment type == 0, distinct coding parameter for each channel 
// For segment type == 1, same coding parameter for all channel 
nNumParmSets = (m_ncSegType ==0) ? m_nChSetLLChannel : 1; 

if ( ! bUseSegStateCodeParm ) 
{ 

// Allocate resources to store coding parameters 
ReallocCodingParm (nNumParmSets ) ; 

// Unpack Rice coding flag false- >linear code; true-> Rice code 
for { nParmlndex = 0; nParmlndex < nNumParmSets; nParmIndex++ ) { 

m_pabRiceCodeFlag [nParmlndex] = (ExtractBits ( 1 ) == 1) ? true : false; 
if (m_ncSegType==0 && m_pabRiceCodeFlag [nParmlndex] ==true) { 

// Unpack Hybrid Rice coding flag 0->Rice code; l-> Hybrid Rice code 
if (ExtractBits (1) == 1) 

// Unpack binary code length for isolated max samples in Hybrid Rice coding 
m_pancAuxABIT [nParmlndex] = ExtractBits (m_nBits4ABIT) +1 ; 

else 

m_pancAuxABIT [nParmlndex] = 0; // indicates no Hybrid Rice coding 

} 

} 

// Unpack coding parameter 

for ( nParmlndex = 0; nParmlndex < nNumParmSets; nParmIndex++ ) 
{ 

if (nSeg==0) 
{ 

// Unpack coding parameter for part 1 of segment 
m_pancABIT0 [nParmlndex] = ExtractBits (m_nBits4ABIT) ; 

if (m_pabRiceCodeFlag [nParmlndex] ==false) // Adjustement for the linear code 
m_pancABIT0 [nParmlndex] = (m_pancABIT0 [nParmlndex] ==0 ) ? : m_pancABIT0 [nParmlndex] +1 ; 
if ( m_ncSegType == ) 
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m_panSamplPartO [nParmlndex] = m_pnAdaptPredOrder [nParmlndex] ; 

else 

m_panSamplPartO [nParmlndex] = m_nCurrHighestLPCOrder ; 

} 

else 

m_panSamplPartO [nParmlndex] = ; 

// Unpack coding parameter for part 2 of segment 
m_pancABIT [nParmlndex] = ExtractBits (m_nBits4ABIT) ; 

if (m_pabRiceCodeFlag [nParmlndex] ==false ) // Adjust for the linear code 

m_pancABIT [nParmlndex] = (m_pancABIT [nParmlndex] ==0 ) ? : m_pancABIT [nParmlndex] +1 ; 

} 

} // ! bUseSegStateCodeParm 

NOTE: For linear encoding, any non-zero ABIT code is incremented by one. 

After the RICE coding flags and the corresponding coding parameters (ABIT) have been unpacked, the remainder of 
the data is unpacked. This is described in clauses 8.4.1.2 and 8.4.1.4 of the present document. 

After unpacking and byte alignment, the position of the bit pointer in bytes shall equal the data section Fsize-1. 



8.4.1 .2 Entropy Codes Unpacking and Decoding 

For each channel, a RICE code flag, a sample bit allocation and residual entropy codes are extracted from the stream. 
The entropy codes are either binary or RICE codes. Furthermore RICE codes may be either 'Straight' RICE codes 
(referred to as RICE codes) or 'Hybrid' RICE codes. 

The RICE code consists of a variable length unary part (all '0') followed by the stop '1' bit and followed by a binary part 
of length m_pancABIT[nParmIndex]. 

The 'Hybrid' RICE coding is employed to guarantee an upper bound (nBits4SamplLoci + 

m_pancAuxABIT[nParmIndex]) on the length of the unary part of RICE codes. In case of 'Hybrid' RICE codes, some 
samples are isolated and coded as linear codes of length m_pancAuxABIT[nParmIndex], while the remaining samples 
are coded as RICE codes. The number and the location index of isolated samples are transmitted in the stream 
immediately before the residual entropy codes. The residual buffer is initialized to '1' at all locations of isolated samples 
and to '0' at all remaining locations. This initial state of residual buffer ('0' or '1') is used to indicate the type of 
extraction that needs to be performed for each residual sample. The two types of extraction are: 

• T implies extraction of linear code of length m_pancAuxABIT[nParmIndex]; and 

• '0' implies extraction of RICE code with coding parameter equal to m_pancABIT[nParmIndex]). 

The 'Hybrid' RICE codes can only exist in segments with m_ncSegType = 0. More details about the RICE codes may be 
found in clause C.7. 

The corresponding code is: 

// Unpack Entropy codes 

for (nCh=0; nCh<m_nChSetLLChannel ; nCh++) { 

pResBuffer = pnlnputBuf f er [nCh] + m_nSamplFrameIndex; 

// For segment type == 0, distinct coding parameter for each channel 
// For segment type == 1, same coding parameter for all channel 
nParmlndex = (m_ncSegType==0 ) ? nCh : ; 

if (m_pabRiceCodeFlag [nParmlndex] == false) { 

// ========================== Linear Code ================================= 

if (m_pancABIT0 [nParmlndex] >0) { 

1 1 = = = = = = = = = = = Unpack all residuals in one channel of part one of segment 

for (n=0; n<m_panSamplPartO [nParmlndex] ; n++) { 

nTmp = ExtractBits (m_pancABIT0 [nParmlndex] ) ; // Unpack as unsigned 
// Map to signed 

*pResBuf f er++ = (nTmp & 0x1) ? - (nTmp>>l) -1 : nTmp>>l; 

} 

} 

else{ 

for (n=0; n<m_panSamplPartO [nParmlndex] ; n++) 
*pResBuf f er++ = 0; 

} 

if (m_pancABIT [nParmlndex] >0) { 
// Unpack all residuals in one channel of part 2 of segment and all other segments 
for (n=0; n<m_nSmplInSeg-m_panSamplPartO [nParmlndex] ; n++) { 
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nTmp = ExtractBits (m_pancABIT [nParmlndex] ) ; // Unpack as unsigned 
// Map to signed 

pResBuf f er [n] = (nTmp & 0x1) ? - (nTmp>>l) -1 : nTmp>>l; 

} 

} 

else{ 

for (n=0; n<m_nSmplInSeg-m_panSamplPartO [nParmlndex] ; n++) 
pResBuf f er [n] = 0; 

} 

} 

else{ // ======================Rice code ================================== 

// ============ Unpack all residuals in one channel of part 1 of segment 

if (m_pancABIT0 [nParmlndex] >0) { 

for (n=0; n<m_panSamplPartO [nParmlndex] ; n++) { 

ExtractUnary (nuTmp) ; // Extract the unary part 

// Extract the binary part and assemble the unsigned word 

nuTmp = (nuTmp<<m_pancABIT0 [nParmlndex] ) | ExtractBits (m_pancABIT0 [nParmlndex] ) ; 
// Map to signed 

*pResBuf f er++ = (nuTmp & 0x1) ? - ( (int) (nuTmp>>l) ) -1 : nuTmp>>l; 

} 

} 

else{ 

// For k=0 there is only unary code plus the stop bit 
for (n=0; n<m_panSamplPartO [nParmlndex] ; n++) { 

ExtractUnary (nuTmp) ; // Extract the unary part 

// Assemble the Signed word 

// Map to signed 

*pResBuf f er++ = (nuTmp & 0x1) ? -((int) (nuTmp>>l) ) -1 : nuTmp>>l; 

} 

} // end ncABITO condition 
// Set all locations to 

memset (pResBuffer, 0, sizeof (int) *m_nSmplInSeg-m_panSamplPartO [nParmlndex] ) ; 
// Unpack the number of isolated max samples when Hybrid Rice coding 
if (m_pancAuxABIT [nParmlndex] >0) { 

nCountlsMax = ExtractBits (nBits4SamplLoci ) ; 
for (n=0; n<nCountIsMax; n++) { 

// Extract the location of isolated max samples and 
// flag the location by 1 

pResBuffer [ExtractBits (nBits4SamplLoci) ] = 1; 

} 

} 

// Unpack all residuals in one channel of part 2 of segment and all other segments 
if (m_pancABIT [nParmlndex] >0) { 

for (n=0; n<m_nSmplInSeg-m_panSamplPartO [nParmlndex] ; n+ + ) { 
if (pResBuffer [n] ==0) { 
// Pure Rice 

ExtractUnary (nuTmp) ; // Extract the unary part 

// Extract the binary part and assemble the unsigned word 

nuTmp = (nuTmp<<m_pancABIT [nParmlndex] ) | ExtractBits (m_pancABIT [nParmlndex] ) ; 

} 

else 

// Isolated max binary coded; 

nuTmp = ExtractBits (m_pancAuxABIT [nParmlndex] ) ; 
// Map to signed 

pResBuf f er [n] = (nuTmp & 0x1) ? -((int) (nuTmp>>l) ) -1 : nuTmp>>l; 

} 

} 

else{ 

// Unpack all residuals in one channel of current segment 
// For k=0 there is only unary code plus the stop bit 
for (n=0; n<m_nSmplInSeg-m_panSamplPartO [nParmlndex] ; n++) { 
if (pResBuffer [n] ==0) { 
// Pure Rice 

ExtractUnary (nuTmp) ; // Extract the unary part 

} 

else 

// Isolated max binary coded; 

nuTmp = ExtractBits (m_pancAuxABIT [nParmlndex] ) ; 
// Map to signed 

pResBuf f er [n] = (nuTmp & 0x1) ? -((int) (nuTmp>>l) ) -1 : nuTmp>>l; 

} 

} // end ncABIT condition 
} // end bRiceCodeFlag condition 

} 
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If the Rice code flag is false, indicating the use of linear codes and the ABIT allocation is zero, then the residual 
samples are all zero and the decoder should fill the unpacked array with zeroes. In the case of linear codes that have a 
non-zero ABIT(s) allocation, the ABIT value is incremented by 1 to accommodate the sign bit of residual that is located 
at the least significant bit and which is removed prior to sign conversion. The linear-encoded residuals should be 
extracted using the adjusted bit allocation and converted to a signed value using: 

*pResBuf f er++= (nTmp) & 01) ? - (nTmp>>l) - 1 : (nTmp>>l) 

Each RICE-encoded residual contains a unary part and a binary part that are extracted separately before combining. For 
zero bit allocation, the code consists of only the unary part and is extracted by finding the number of zeros preceding 
the stop bit (1). 

The remainder of the RICE code, the binary part, is extracted using ABIT and is converted to a signed value in the same 
way as the linearly coded residuals. Both unary and binary parts are then combined by shifting the unary part left by 
(ABIT) places before performing a bit wise logical 'OR' with the binary part. 

8.4.1 .3 Decimator History Unpacking 

Within the MSB data for the first segment of frequency bands 1 and 3, a set of decimator frequency band history 
coefficients shall be extracted after unpacking the entropy codes. 

// Unpack decimator history 

if (nSeg == && (nFreqBand==l | | nFreqBand==3 ) ) 
{ 

int nNumBitsForHistSampl ; 

nNumBitsForHistSampl = ExtractBits ( 5 ) +1 ; // Unpack m_unNumBitsFBTxHistSamples 

for <nCh=0; nCh<m_nChSetLLChannel ; nCh++) 
{ 

for (n=0 ; n<7 ; n++ ) 

pnDeciHistoryFreqBand [nCh] [n] = ExtractBits (nNumBitsForHistSampl ) ; 

} 

} 

The value of nNumBitsForHistSampl will always be 32 bits. 

8.4.1 .4 LSB Residual Unpacking 

If FlagScalableLSBs is set, the LSB part of residuals shall also be extracted. The beginning of the LSB data can be 
found at: 

LSB_Start = Start of current channel set in current frequency band + BandChSetSize[Band][Seg][ChSET] - 
nLSBFsize[nBand]. 

If channel sets CRCs are embedded, then LSBJStart is reduced by the width of the checksum field. 

The parameter nLSBFsize[nBand] is the same size for all segments of frequency band nBand and it is obtained from the 
transmitted parameter in a channel set's sub header. 

The LSB part is linearly coded and should be extracted using the corresponding bit allocation 

(pnScalableLSB [nBand] [ch]) for all segments of frequency band nBand in the current frame. The remaining LSB 

residuals are simply unpacked as linear codes. 

8.4.2 Fixed Coefficient Prediction 

The inverse fixed coefficient prediction process, on the decode side, is defined by an order recursive formula for the 
calculation of k th order residual at sampling instance n: 

e k [n] = e k+l [n] + e k [n-\] 

where the desired original signal s[n] is given by: 

s[n] = e Q [n] 

and where for each k th order residual e k [-l] = 0. 
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In the following example, recursions for the 3 rd order fixed coefficient prediction, where the residuals e 3 [n] are coded, 
are transmitted in the stream and unpacked on the decode side: 

e 2 [n] = e 3 [n]+e 2 [n-l] 
e 1 [n] = e 2 [n]+e 1 [n-l] 
e Q [n] = e l [n]+e Q [n-l] 
s[n] = e [n] 

The following code demonstrates inverse fixed prediction in the frequency band nBand. 

for (nCh=0; nCh<m_nNumCh; nCh++) { 

// for m_pnFixedPredOrder [nBand] [nCh] =0 e0 = input sample 
if (m_pnFixedPredOrder [nBand] [nCh]>0){ 
// Zero out channel working buffer 

memset ( SprnWorkResBuf f er [0] , 0, SIZE_WORK_RES_BUFFER*sizeof (int) ) ; 
pnlnTemp = pnlnputBuf f er [nCh] ; 
for (n=0; n<m_nFrmSize ; n++) { 

pnBuffTemp = pnWorkResBuf f er ; 

*pnBuffTemp = pnlnTemp [n] ; // load residual 

for (nOrd=0; nOrd< m_pnFixedPredOrder [nBand] [nCh] ; nOrd++) { 
// Calculate the ek [n] = ek+1 [n] + ek[n-l] 
pnBuffTemp [2] = pnBuffTemp [0] + pnBuf f Temp [1] ; 
// Save the current residuals for the next sample iteration 
pnBuffTemp [1] = pnBuf f Temp [2] ; 
pnBuffTemp += 2; 

} 

// Save the regenerated sample 
pnlnTemp [n] = *pnBuf f Temp ; 
} // end sample loop in the frame 
} // end if (m_pnFixedPredOrder [nBand] [nCh]>0) 
} // end channel loop 



8.4.3 Inverse Adaptive Prediction on the Decode Side 

The block diagram in Figure 8-6 depicts the inverse adaptive prediction process on the decoder side. 

The first step in performing inverse adaptive prediction is to extract adaptive prediction orders PrOr [Ch] for each 
channel Ch=l , ... NumCh. Next, for the channels with PrOr [Ch] >0, the unsigned version of linear area ratios 
(LAR) quantization indices (PackLARInd [n] for n=l, ... PrOr [Ch] ) are extracted. For each channel Ch with 
prediction order PrOr [Ch] >0, the unsigned PackLARInd [n] are mapped to the signed values QLARInd [n] using 
the following mapping: 



QLARInd[n] ■■ 



[ PackLARInd[n] » 1 V even numbered PackLARInd[n] 
[— (PackLARInd[n] » 1) — 1 V odd numbered PackLARInd[n] 
for n = l,...,PrOr[Ch] 



where the » denotes an integer right shift operation. 
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Figure 8-6: Inverse Adaptive Prediction 

In the 'Table Look-up of quantized RC block, an inverse quantization of LAR parameters and a translation to reflection 
coefficient (RC) parameters is done in a single step using a look-up table TABLE { } defined as: 

TABLE ={0, 3070, 5110, 7140, 9156,11154,13132,15085,17010, 

18904, 20764, 22588, 24373, 26117, 27818, 29474, 31085, 32648, 
34164, 35631, 37049, 38418, 39738, 41008, 42230, 43404, 44530, 
45609, 46642, 47630, 48575, 49477, 50337, 51157, 51937, 52681, 
53387, 54059, 54697, 55302, 55876, 56421 , 56937, 57426, 57888, 
58326, 58741, 59132, 59502, 59852, 60182, 60494, 60789, 61066, 
61328, 61576, 61809, 62029, 62236, 62431, 62615, 62788, 62951, 
63105, 63250, 63386, 63514, 63635, 63749, 63855, 63956, 64051, 
64140, 64224, 64302, 64376, 64446, 64512, 64573, 64631, 64686, 
64737, 64785, 64830, 64873, 64913, 64950, 64986, 65019, 65050, 
65079, 65107, 65133, 65157, 65180, 65202, 65222, 65241, 65259, 
65275, 65291 , 65306, 65320, 65333, 65345, 65357, 65368, 65378, 
65387, 65396, 65405, 65413, 65420, 65427, 65434, 65440, 65446, 
65451 , 65456, 65461 , 65466, 65470, 65474, 65478, 65481 , 65485, 



65488, 65491}. 
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The quantized reflection coefficients for each channel Ch (QRC [n] for n= 1, ... PrOr [Ch] ) are calculated 
from the TABLE { } and the quantization LAR indices QLARInd [n] , as: 



QRC[n] = ■ 



jTABLE[QLARInd[n]] \/QLARInd[n] > 
{- TABLE[-QLARInd[n]] VQLARInd[n] < 
forn = l,...,PrOr[Ch] 



In the next block, for each channel Ch, the quantized RC parameters QRC ord for ord = 1, ... PrOrfCh] are translated 
to the quantized linear prediction parameters (LP ord for ord = 1, ... PrOr[Ch]) according to the following algorithm: 

Forord = 0toPrOr-ldo 
For m = 1 to ord do 

C ord+l,m = C ord,m +(Q RC ord+l X C ord , ord+l-m + C 1 « l5 )) >>l6 

end 

C rd+l,ord+l = Q R C rd+l 

end 

Forord = 0toPrOr-ldo 

LPord + 1 = C PrOr, ord + 1 
end 

Any possibility of saturation of intermediate results is removed on the encode side. Therefore on the decode side, there 
is no need to perform saturation check after calculation of each C ord +1 m . 

Finally, for each channel with PrOr[Ch]>0, an inverse adaptive linear prediction is performed. Assuming that prediction 
residuals e(n) are previously extracted and entropy decoded, the reconstructed original signals s(n) are calculated 
according to the following equations: 



s(n)- 



fPr Or[Ch] 

\ LP k Xs{n-k)\ + (l «15) 



lc=l 



»16 



Limit s{n) to 24 — bit range (— 2 23 to 2 23 — lj 
e{ri) = s{n) — s{n) 

for n = PrOr[Ch] + l, ... NumSamplInFrame 

Since the sample history is not persistent between the frames, the inverse adaptive prediction shall start from the 
(PrOr[Ch]+l) sample in the frame. 

The first of two related function calls show the conversion of reflection coefficients. 

// Read the quantized reflection coefficients from the table 
for (nOrd = 0; nOrd < nLPCOrder; nOrd++) { 
nTmp = pRef ICof Qind [nOrd] ; 

nRcq [nOrd] = (nTmp>=0) ? m_pnTanhTbl [nTmp] : -m_pnTanhTbl [ -nTmp] ; 

} 

// Conversion from reflection coefficients to direct form coefficients 
pnRcq = nRcq; 

for (nOrd = 1; nOrd <= nLPCOrder; nOrd++) { 
pn_F_Coef = naCoeffs; 
pn_B_Coef = naCoeffs + nOrd - 2; 
for (n = 1; n <= (nOrd>>l) ; n++) 
{ 

nTmp = *pn_F_Coef; 
nTmpl = *pn_B_Coef; 

// No need to check for saturation; prevented in the encoder 

*pn_F_Coef++ = nTmp + (int) ( ( ( (DTS int64) (*pnRcq) * nTmpl) + nRound) >> Q_LL_PREDCOEFFS ) ; 

*pn_B_Coef-- = nTmpl + (int) ( ( ( (DTS int64) (*pnRcq) * nTmp) + nRound) >> Q_LL_PREDCOEFFS ) ; 

} 

naCoef f s [nOrd-1] = *pnRcq++; 
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// Reverse the order of coefficients 
pn_B_Coef = naCoeffs + nLPCOrder -1; 
for (nOrd=0; nOrd<nLPCOrder ; nOrd++) 

pnLPCCoef [nOrd] = *pn_B_Coef - - ; 
Inverse adaptive prediction is covered below. 
pReflCofQind = m_pnLPCRef ICoef f sQInd; 
for (nCh=0; nCh<m_nNumCh; nCh++) { 

nPredOrd=m_pnAdaptPredOrder [nCh] ; 

if (nPredOrd>0) { 

// Get the prediction coefficients 

if { !RCInd2Coef f s (nPredOrd, pRef ICof Qind, pnAdCoeffs) ) 
return false; 

// Second part of the frame where the residuals need to be 
// replaced with actual samples 
nStartResOut = nPredOrd; 

pnlnTemp = pnlnputBuf fer [nCh] +nStartResOut ; 
pnFiltBuffer = pnlnTemp-nPredOrd; 
for (n=nStartResOut ; n<m_nFrmSize ; n++) { 
ndErr = ; 

for (nCoef lnd=0 ; nCoef Ind<nPredOrd; nCoefInd++) { 

ndErr += ( (DTS int64) pnFiltBuf f er [nCoef Ind] ) *pnAdCoef f s [nCoef Ind] ; 

} 

// Round and scale the prediction; Coefficients are in Q_LL_PREDCOEFFS 
nxhat = (int) ((ndErr + (1<< (Q_LL_PREDC0EFFS-1) ) ) >>Q_LL_PREDCOEFFS) ; 
if ( nxhat >nHigh) 

nxhat = nHigh; 
else if (nxhat<nLow) 

nxhat = nLow; 

// Calculate the original sample x(n) = err - xhat (n) 
// The saturation check is not needed; it has been taken 
// care on the encode side 

nRes = *pnInTemp; 
*pnInTemp++ = nRes - nxhat ; 

pnFiltBuf fer++ ; 
} // End sample loop in the frame 
} // end of if (nPredOrd>0) 
pReflCofQind += nPredOrd; 
} // end channel loop 

8.4.4 Inverse Pairwise Channel Decorrelation 

When a channel pair has a non-zero pairwise channel decorrelation coefficient, pairwise channel de-correlation 
(PWChD) shall be applied to the channel pair. The current channel is the basis (BCh) and the next channel is the 
decorrelated channel (DecCh). To recover the original channel data (CorCh), each of the samples/residuals in the basis 
channel are scaled by the pairwise channel decorrelation coefficient (a) and then added to the corresponding 
samples/residuals of the decorrelated channel. 

CorCh = DecCh+ aBCh 

The original channel order should also be restored before the lossless residuals are combined with the lossy output. (The 
channel order may have been changed at the encoder to make best use of Pairwise Channel Decorrelation). 

Individual channels in a channel set are numbered beginning from 0. The matrix field OrigChanOrder indicates the 
reordering. The following sample code illustrates the decoded buffers being copied and scaled and then reordered. 

// Inverse channel decorrelation 

for (nCh=0; nCh< (m_nNumCh>>l) ; nCh++) { 

ndCoeff = (DTS int64) pnJRChPairsCoef f s [nCh] ; 

if (ndCoef f ! = 0) { 

pnB = pnlnputBuf fer [nCh<<l] ; 
pnj = pnlnputBuf fer [ (nCh<<l) +1] ; 
for (n=0; n<m_nFrmSize ; n++) 

*pnJ++ += (int) ( ( ndCoef f * (*pnB++) + nRound) >>nQ ) ; 




// Reorder channel pointers to the original order 
for (nCh=0; nCh<m_nNumCh; nCh++) 

ppSavelnBuf f Pntrs [nCh] = pnlnputBuf fer [nCh] ; 
for (nCh=0; nCh<m_nNumCh; nCh++) 

pnlnputBuf fer [pncOriginalChOrder [nCh] ] = ppSavelnBuff Pntrs [nCh] ; 
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8.5 Lossless Processes 

8.5.1 Assembling the MSB and LSB Parts 

Since the residuals in a frequency band may have been encoded using an MSB/LSB split, the MSBs of the residuals will 
have been scaled as part of the lossless encoding algorithm and the LSBs will appear as binary codes width length equal 
to NumScalableLSBs[nCh]. The use of this MSB/LSB split is indicated per frequency band and per channel on a 
frame -by-frame basis in the bScalableLSB (for frequency band 0) and bFlagScalableResExtBand[] (for each frequency 
extension band) fields of the header. 

When a MSB/LSB split is enabled in a frequency band each channel set of each segment consists of two components 
namely the MSB and the LSB part, as shown in Figure 8-7. For lossless operation both the MSB and LSB components 
are required. 




Figure 8-7: Layout of MSB and LSB Data Within One Frequency Band 

The MSB component alone represents the most significant portion of the output. The decoder needs to know the width 
(in bits) of the output and the width of the MSB part in order to properly scale the MSB component before inserting it 
into the output. 

The MSB/LSB split on the encode side may employ either variable or fixed-split point method. The fixed split point 
method has fixed lengths of the MSB and LSB parts for the entire stream duration and for all channels. The split point is 
determined by a parameter nuFixedLSBWidth>0 that represents the length of the LSB part prior to a possible bit width 
adjustment performed at the authoring stage. Notice that nuFixedLSBWidth > implies the LSB width > 0, which 
consequently allows for bit- width adjustment in every frame. Although the original MSB/LSB split point is fixed for all 
channels and all frames, the bit-width adjustment may be variable over the channels and the frames as carried in 
nBitWidthReduction[nCh]. After performing a variable bit-width adjustment to the stream that has been originally 
created using the fixed split method (nuFixedLSBWidth>0) the resulting stream will have variable length LSB parts and 
consequently nuFixedLSBWidth will be adjusted to be equal to 0. 

In contrast, the variable-split point method allows the lengths of the MSB and LSB parts to vary from frame to frame 
and between the channels. In this case, where parameter nuFixedLSBWidth=0, the split point in each channel nCh, is 
determined by the NumScalableLSBs[nCh]. This number represents the original width of the LSB part as used during 
the encode process and prior to a possible bit width adjustment at the authoring stage. Notice, in this case, some frames 
may have NumScalableLSBs[nCh]=0 and consequently the bit-width adjustment is not possible. This situation may 
occur for the low levels of input audio signal. However, at the low levels of input audio signal, the variable output bit 
rate is already low and usually there is no need for bit-width adjustment at the authoring stage. 

Both methods allow for dynamic bit-width adjustment. For example, the peak bit rate of some encoded material may 
exceed the allowed peak bit rate of the medium at very few instances and the peak bit-rate need to be reduced only at 
those instances; therefore, sacrificing the lossless reconstruction at very few moments of the entire presentation. Unlike 
the variable split method, the fixed-split point method allows for uniform bit-width adjustment over the entire stream 
duration and in all channels. As a result, it can guarantee non-varying noise floor in all channels, producing the output 
audio equivalent to the audio obtained from uniformly reducing the bit-width to the desired number of bits (after 
applying appropriate dither). The variable-split point method usually gives a better compression performance then the 
fixed-split point method. 

In both, the fixed-split and the variable-split point methods, the NumScalableLSBs[nCh] denotes the length of binary 
codes that correspond to the samples of the LSB part in channel nCh. 
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Beginning with an array of assembled MSB sample data and the LSB sample data, the reconstruction of the lossless 
output (with bit-width equal to nBitResolution) of particular channel nCh, in case when no bit-width adjustment has 
been performed during the authoring stage, is shown in Figure 8-8. The NumScalableLSBs[nCh] reflects the number of 
bits that are used to binary code the LSB samples in channel nCh. 





nShift ► 


MSBs 


Zeros 





nShift ► 


Zeros 


LSBs 



-nBitResolution- 



MSBs and LSBs Combined 



nBitWidthReduction[nCh] = 

nShift = (nuFixedfLSBWidth==0) ? NumScalableLSBs[nCh]: nuFixedfLSBWidth 



Figure 8-8: Output Word Assembly for Channels Without Bit-width Adjustment 

(nBitWidthReduction[nCh]=0) 

The bit-width management process performed at the authoring stage may reduce the resolution of the samples 
corresponding to the LSB part of channel nCh by a reduction factor carried in a stream as nBitWidthReduction[nCh]. 
During this bit-width adjustment process a dither signal is added to the LSB samples. As a consequence the remaining 
LSB samples after removal of the nBitWidthReduction[nCh] lower significant bits require NumScalableLSBs[nCh] - 
nBitWidthReduction[nCh] + 1 bits for their transmission. An additional bit corresponding to the + 1 term in the above 
expression, is a result of adding a dither signal. 

Notice that after the bit-width adjustment process, the NumScalableLSBs[nCh], that are transmitted in the stream, are 
altered such that they still represent the binary code length that is used to code the adjusted samples of the LSB part, 
i.e.: 

NumScalableLSBs[nCh] = NumScalableLSBs[nCh] - nBitWidthReduction[nCh] +1 

The reconstruction of the lossless output of particular channel nCh, in case when a bit-width adjustment has been 
performed during the authoring stage, is shown in Figure 8-9. The NumScalableLSBs[nCh] reflects the number of bits 
that are used to binary code the LSB samples after the bit-width reduction process (denoted as "Remaining LSBs"). The 
nBitWidthReduction[nCh], indicates the number of lower significant bits that were removed from the LSB samples 
during the bit-width adjustment process. 
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Zeros 



-nBitResolution- 



MSBs and Remaining LSBs Combined 



-nAdj- 
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nAdj = nBitWidthReduction[nCh] > 
nShift= (nuFixedfLSBWidth==0) ? NumScalableLSBs[nCh -1 + nAdj : nuFixedfLSBWidth 

Figure 8-9: Output Word Assembly for Channels with Bit-width Adjustment 
(nBitWidthReduction[nCh]>0) 

The following pseudo-code illustrates assembling MSB and LSB parts: 

for (nCh=0; nCh<nNumCh; nCh++) 
{ 

nShiftAdj=m_pnBitWidthAdjPerCh [nBandlndex] [nCh] ; 

if (nuFixedLSBWidth==0) { 
nShift = {nShif tAdj >0 && m_pnScalableLSBs [nBandlndex] [nCh]>0) ? 
m_pnScalableLSBs [nBandlndex] [nCh] -1 : m_pnScalableLSBs [nBandlndex] [nCh] ; 
nShift += nShif tAdj ; 

} 

else 



nShift 



nuFixedLSBWidth ; 



if <nShift>0) { 

pnlnSamples = pnlnputBuf f er [nCh] ; 
punLSBPart = pnlnResAHChLSBs [nCh] ; 
for (n=0; n<m_nFrmSize ; n++) { 

nTmp = ( *pnInSamples ) <<nShif t ; 

*pnInSamples++ = ( nTmp + ( (*punLSBPart++) <<nShif tAdj ) ); 

} 

} 



To obtain the output words, the resulting samples still need to be shifted left by NumEmptyLSBs places. 



8.5.2 Channel Sets Post-Processing 

In order to achieve scalability of the stream and the decoding process, all channels presented to the encoder are 
organized into channel sets. There can be up to 16 channel sets, each with up to 16 channels. Each channel set can be 
separately extracted from the stream and decoded as desired. The main factors that determine the formation of the 
channel sets are the following: 

1) An intended speaker layout for a channel set, i.e. a 7.1 mix, can be encoded in these 3 channel sets: 

a) Decoding of a channel set produces a stereo downmix. 
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b) Decoding and combining of the channel sets and 1 produces a 5.1 downmix. 

c) Decoding and combining of the channel sets 0, 1 and 2 produces the original 7.1 mix. 

2) All channels in a channel set always have the same sampling frequency. In a 5.1 stream where the C, L and R 
channels are at 96 kHz and the Ls, Rs and LFE channels are at 48 kHz, the channels may be organized in two 
channel sets, where channel set consists of all channels sampled at 96 kHz and channel set 1 consists of all 
channels sampled at 48 kHz. 

3) All channels in a channel set always have the same bit-width. For example, for a 5. 1 stream where the L and R 
channels are 24-bit and the C, Ls, Rs and LFE channels are 16-bit, the channels may be organized in two 
channel sets, where channel set consists of all channels with 24-bit samples and channel set 1 consists of all 
channels with 16-bit samples. 

4) An intended version of a sound object for a channel set where one version of a sound object can be replaced by 
another version of the same sound object. For example, there may be a set of channels supporting one version 
of some sound object (Vl_Soundl) and another set of channels supporting a different version of the same 
sound object (V2_Soundl). The Vl_Soundl and V2_Soundl channels sets are replacements for each other 
and they belong to the same replacement group. Only one of the Vl_Soundl or V2_Soundl channel sets may 
be designated as the active set. In addition to designating both the Vl_Soundl and V2_Soundl channel sets as 
replacement sets, the header will also indicate they are both members of the same replacement group. There is 
a maximum of three replacement groups. 

Primary channels represent the subset of all encoded channels. The primary channels may contain the downmixed 
version of other non-primary channels. However the primary channels themselves cannot be further used for 
encoder-embedded downmixing. 

The decoding of primary channels is mandatory. An example is a 7.1 stream with an embedded 5.1 downmix that is 
encoded in two channel sets, where channel set contains a 5.1 downmix and channel set 1 contains two additional 
surround channels. In this case, the channels that correspond to the 5.1 downmix are primary channels and channel set 
is denoted as a primary channel set. 

The primary channels may be split to a maximum of four primary channel sets with differing sampling frequencies 
and/or bit widths. This makes it possible to have a scenario where, for the 5.1 stream, the Centre (C), Left (L) and Right 
(R) channels have a different bit width and/or sampling frequency than the Left Surround (Ls), Right Surround (Rs) and 
Low Frequency Effects (LFE) channels. In this example, the C, L and R channels are part of one primary channel set 
and the Ls, Rs and LFE channels are part of another primary channel set. 

Although in general, the primary channels in different primary channel sets can have different sampling frequencies, 
there are restrictions to this rule, which depend on the existence of the lossy core data. When the lossless stream 
includes the lossy core, all the channels that are coded using both lossy and lossless codecs always have the same 
sampling frequency. Consequently the primary channels that are coded with both lossy and lossless codecs can only be 
split into different primary channel sets based on differing bit-widths. 

8.5.2.1 Performing and Reversing Channel Set Downmixing 

To allow for the scalability of decoder complexity, the audio in the compressed data stream may be organized in 
multiple channel sets with the intermediate downmix formats (i.e. 10.2 -> 7.1 -> 5.1 -> stereo) embedded in a particular 
channel set(s). 

For example for a 10.2 encoded stream with two embedded downmix configurations (5.1 downmix and stereo 
downmix), at least three channel sets would be needed, organized such that: 

• decoding of a channel set produces a stereo downmix; 

• decoding and combining of channel sets and 1 produces a 5.1 downmix; and 

• decoding and combining of channel sets 0, 1 and 2 produces the original 10.2 mix. 

In the example above, the reconstruction of a 5.1 downmix requires the reversal of the 5.1 -> stereo downmix process 
that has been performed on the encode side. Similarly, the reconstruction of a 10.2 original mix requires the reversal of 
first 5.1 -> stereo and then 10.2 -> 5.1 downmix processes that have been performed on the encode side. 

Two types of channel-set downmixing are supported: parallel and hierarchical. 
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8.5.2.2 Parallel Downmix 

The parallel type of downmix is performed for non-primary channel sets which have: 

• a downmix-embedded enabled flag (bDownmixEmbedded=true); 

• a hierarchical-downmix flag disabled (bHierChSet=false), and 

• nReplacementSet = 0. 

The parallel downmix is performed directly to the primary channel set(s), as shown in Figure 8-10. They are applied in 
sequential channel-set index order from the first defined parallel (non-primary) channel set to the last defined parallel 
(non-primary) channel set. 
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Figure 8-10: Parallel Downmix 

On the decode side, the reversal of the parallel downmix is performed in the opposite order - that is, from the last 
defined parallel (non-primary) channel set to the first defined parallel (non-primary) channel set, as shown in 
Figure 8-11. 




Figure 8-11: Parallel Downmix Reversal 
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A 5.1 stream organized into three channel sets with the following characteristics provides an example of parallel 
channel set downmixing and reversal: 

1) channel set carries the stereo downmix with 24-bit resolution at 48 kHz; 

2) channel set 1 carries C channel with 24-bit resolution at 48 kHz; and 

3) channel set 2 carries the Ls, Rs and LFE channels with 16-bit resolution at 48 kHz. 

In this example, channel set is a primary channel set. Both channel set 1 and channel set 2 are non-primary channel 
sets in which bDownmixEmbedded=true, bHierChSet=false and nReplacementSet=0. First channel set 1 is downmixed 
to channel set and then channel set 2 is downmixed to channel set 0. 

On the decode side, the steps are reversed. First channel set 2 is downmixed to channel set 0. Then channel set 1 is 
downmixed to channel set 0. 

8.5.2.3 Hierarchical Downmix 

The hierarchical type of downmix, shown in Figure 8-12, is performed for non-primary channel sets that have these 
three characteristics: 

1) a downmix-embedded enabled flag (bDownmixEmbedded=true); 

2) a hierarchical downmix flag enabled (bHierChSet=true); and 

3) nReplacementSet=0. 




Figure 8-12: Hierarchical Downmix 



The hierarchical downmix is performed in a hierarchical manner, moving from the channel set with the highest index to 
the channel set with the lowest index (primary set(s)). The non-primary channel set may be downmixed in a hierarchical 
manner, to either: 

• the lower-indexed non-primary channel set; or 

• the lower-indexed primary channel set. 

An example is provided by a 10.2 channel stream with two hierarchically embedded downmix configurations 
(10.2 -> 7.1 -> 5.1) organized in three channel sets with the following characteristics: 

• decoding of a channel set produces a 5.1 downmix; 

• decoding and combining channel sets and 1 produces a 7.1 downmix; and 

• decoding and combining channel sets 0, 1 and 2 produces the original 10.2 mix. 

Figure 8-12 illustrates the order in which the downmixing occurs. First the 10.2 stream is downmixed to 7.1 (defined in 
the channel set 2 downmix matrix). Then the 7.1 stream is downmixed to 5.1 (defined in the channel set 1 downmix 
matrix). If the 7.1 to 5.1 downmix is not defined and a 5.1 output is not requested, no downmixing occurs. 
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On the decode side (Figure 8-13), the 7.1 -> 5.1 process is reversed, by subtracting the channel set 1 contribution (by 
means of the channel set 1 downmix matrix) from channel set 0. The resulting modified channel set data is to replace 
the transmitted channel set data. Next the 10.2 to 7.1 process is reversed by subtracting the channel set 2 contribution 
(by means of the channel set 2 downmix matrix) from both channel set 1 and modified channel set 0. 




Figure 8-13: Hierarchical Downmix Reversal 



9 LBR 

This clause addresses the LBR extension. LBR is a low bit rate coding system used in place of the core and extensions 
previously described, but operating in the same ecosystem. The LBR payload is used in conjunction with the extension 
substream header to provide the necessary start-up elements and optional metadata elements required for a complete 
audio elementary stream. 



9.1 The LBR Decoder Environment 

In addition to codec initialization constants carried in, or derived from, information provided in the LBR header, the 
LBR memory map stores a number of other variable session parameters. Header initialized parameters relate to the 
audio sampling frequency and the channel map information, which sets up the frame duration, length of a subframe, 
channel pairing, etc. Other persistent parameters are variables that are usually set to zero at the start of a new decoding 
session, then are updated by the various algorithms from consecutive coding blocks. Some of these constants and 
variables are referred to in the decoding block descriptions, so they are described in the following clause. 



9.1 .1 Persistent Constants and Variables 

The following symbols and variable names are used throughout clause 9 describing the LBR extension. 
nChannels Total number of encoded channels. 



nPair 



Number of channel pairs in the audio stream. nPair is calculated as: 

„ . nChannels+l 
nPair - 



nSampleRate The audio sampling frequency, (see Table 9-3). 

FreqRange A coded parameter derived from nSampleRate (see Table 9-4). 

nFrameDuration The number of rendered audio samples per channel per audio frame (Table 9-11). 



ETSI 



147 



ETSI TS 102 114 V1. 4.1 (2012-09) 



nBitsLeft The number of bits yet to be extracted from the chunk being processed (initialized by the 

chunk length parameter then decremented as bits are parsed). 

Qlevels Quantization levels, this array is configured in the High Resolution chunk. 

SecChPres This is a flag indicating that a secondary channel is present. This flag is set in DecodeTS() if a 

secondary channel can be decoded. 

TSCodingMethod TSCodingMethod is a flag that indicates whether time samples are packed using Huffman 

codes or packed directly. This parameter is configured in DecodeTS(). 

9.2 LBR Extension Substream Header 

The syntax of the LBR audio header is described in Table 9-1. 



Table 9-1 : LBR Extension Substream Header Structure 



Syntax 


Size (Bits) 


// Extract sync word 0x0a801921 
SYNCEXTLBR = ExtractBits (32) 
// Extract LBR header type: 
ucFmtlnf oCode = ExtractBits ( 8 ) 
if (ucFmtlnf oCode == 2) 

{ // LBR decoder initialization data follows: 
nLBRSampleRateCode = ExtractBits ( 8 ) 
usLBRSpkrMask = ExtractBits ( 16 ) 
nLBRversion = ExtractBits ( 16 ) 
nLBRCompressedFlags = ExtractBits ( 8 ) 
nLBRBitRateMSnybbles = ExtractBits ( 8 ) 
nLBROriginalBitRate LSW = ExtractBits ( 16 ) 
nLBRScaledBitRate LSW = ExtractBits ( 16 ) 

} // end if (ucFmtlnf oCode == 2) 

else if (ucFmtlnf oCode != 1) 

{ 

// unknown ucFmtlnf oCode : resync to next SYNCEXTSSH 

} 

// LBR compressed audio data follows 


32 
8 

8 

16 
16 
8 
8 
16 
16 

Extends to next SYNCEXTSSH sync 
word, as determined by 
nuExtSSFsize in Extension 
SubstreamHeader 



SYNCEXTLBR (extension substream sync word for LBR) 

The extension substream has a DWORD-aligned synchronization word dts syncword lbr with the hexadecimal value 
of 0x0a801921. By default the sync word (DWORD) will only occur on a DWORD boundary (4 Bytes). 

ucFmtlnfoCode 

This unsigned 8-bit value declares the LBR header type. Currently two LBR header types are defined 
(LBR_HDRCODE_SYNC_ONLY and LBR_HDRCODE_DECODERINIT). If an undefined header type is 
encountered, the decoder should resync to the next DTS-HD Extension Substream sync word. The first header type that 
a decoder should handle is LBR_HDRCODE_DECODERINIT because this header is necessary to initialize the LBR 
decoder with bitrate, samplerate, channel count and flags. 



Table 9-2: ucFmtlnfoCode values 



LBR header type 


Value 


Description 


LBR_HDRCODE_SYNC_ONLY 


1 


Header consists only of LBR sync word and this 
ucFmtlnfoCode byte; thus raw LBR audio data follows. 


LBRHDRCODEDECODERINIT 


2 


Header consists of LBR sync word, this ucFmtlnfoCode byte 
and LBR decoder initialization data. Raw LBR audio data will 
follow the LBR decoder initialization data. 


Reserved 


All undeclared values are reserved for future use. 
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nLBRSampleRateCode (sample rate of LBR audio samples) 

The sample rate of the LBR decoded audio. This 8 bit value is a lookup into a sample rate table and defines the sample 
frequency of the decoded audio samples. nSampleRate, as shown in Table 9-3. 



Table 9-3: nLBRSampleRateCode Sample Rate Decoding 



n 1 R DCamnloDatQpnrlo 
IlLDnOdnpicndlcL/Otlc 


nodmpicncllc 


U 


Q AAA 

o UUU 


\ 


1 C AAA 
\ UUU 


O 
c. 


OO AAA 
Oc. UUU 


3 


reserved 


4 


reserved 


5 


22 050 


6 


44 100 


7 


reserved 


8 


reserved 


9 


reserved 


10 


12 000 


11 


24 000 


12 


48 000 


13 


reserved 


14 


reserved 


15 


reserved 



If a valid frame is encountered in which the nLBRSampleRateCode sample rate differs from the nLBRSampleRateCode 
sample rate of the previous valid frame, it would indicate that a new stream has been encountered. In this case, the 
existing LBR decoder would need to be re-initialized with the new header values (including the new 
nLBRSampleRateCode sample rate) and decoding input and output buffers re-allocated to accommodate the new 
sample rate. 

The decoder parameter FreqRange is set according to nSampleRateCode according to Table 9-4. 



Table 9-4: FreqRange 



Range of Source Sampling Frequency 


FreqRange 


nSampleRate < 16 000 





1 6 000 < nSampleRate < 32 000 


1 


32 000 < nSampleRate < 50 000 


2 



usLBRSpkrMask (LBR speaker mask) 

This 16-bit value little-endian value describes the speaker mask for the LBR audio asset. usLBRSpkrMask follows the 
same convention as the nuSpkrActivityMask defined in Table 7-10. 

nLBRversion (LBR bitstream version) 

This 16-bit field represents the LBR bitstream version number, (distinct from the overall DTS-HD version). The version 
is represented in little-endian format as, 

OxMMmm , 

where: 

'MM' is the major version number expressed in the high order byte 
and 

'mm' is the minor revision number expressed in the low order byte. 

A decoder should reject LBR substreams whose LBR bitstream major version number does not match the decoder's 
LBR bitstream major version number. 
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nLBRCompressedFlags (flags for LBR decoder initialization) 

This 8-bit field is a compressed version of the LBR initialization flags. Table 9-5 shows the bit positions of the flags 
within nLBRCompressedFlags. 



Table 9-5: Parameter nLBRCompressedFlags 



Bit 7 
(MSB) 


6 


5 


4 


3 


2 


1 





Reserved 


Multichannel 
downmix: 
0=none 
1=present 


Stereo 
downmix: 

0=none 
1=present 


Bandlimit 
Flag MSB 


Bandlimit 
Flag 


Bandlimit 
Flag LSB 


LFE 

Channel: 
0=none 
1 =present 


Sample 

Size: 
0=16 bits 
1=24 bits 



The flag descriptions may be found in Table 9-6. 



Table 9-6: LBRFIags from bLBRCompressedFlags 



nLBRCompressedFlag 

s 


Corresponding LBR Flags 


Description 


0b00000001 


LBR_FLAG_24_BIT_SAMPLES 


If 1 : the input/output PCM audio samples are 24 
bits in length. Otherwise the samples are 16 bits 
in length. 


0b00000010 


LBR_FLAG_USE_LFE 


If 1 : LFE channel is present. 


obooomoo 


LBR_FLAG_BANDLMT_MASK 


Mask to isolate flags which describe bandlimit 
factors, which enhance audio quality with certain 
sample rate/bitrate combinations. 


0b001 00000 


LBR FLAG STEREO DOWNMIX 


If 1 : Stereo downmix is present within bitstream. 


0b01 000000 


LBR FLAG MULTICHANNEL DOWNMI 

X 


If 1 : Multichannel downmix is present within 
bitstream. 



The bandlimit flag bits indicate upsampling in the decoder is necessary to restore the original sampling frequency. The 
supported resampling ratios are shown in Table 9-7. 



Table 9-7: LBR band limit flags from nLBRCompressed Flags 



nLBRCompressedFlags 


Corresponding LBR Flag values 


Description 


0b00000100 


LBR FLAG BANDLMT FACTOR 2 3 


Limit bandwidth to 2/3 of original bandwidth. 


0b00001000 


LBR FLAG BANDLMT FACTOR 1 2 


Limit bandwidth to1/2 of original bandwidth. 


0D00001100 


LBR FLAG BANDLMT FACTOR 1 3 


Limit bandwidth to 1/3 of original bandwidth. 


0b00010000 


LBR FLAG BANDLMT FACTOR 1 4 


Limit bandwidth to 1/4 of original bandwidth. 


ObOOOHOOO 


LBR FLAG BANDLMT FACTOR 1 8 


Limit bandwidth to 1/8 of original bandwidth. 


0b00010100 


LBR FLAG BANDLMT FACTOR NONE 


Do not change bandwidth. 


all unused values 


Reserved for future use 





nLBRBitRateMSnybbles (most-significant nibbles of LBR stream original and scaled bitrates) 

The LBR bitstream header carries two bitrates: 

• the original bitrate; 

• the scaled bitrate which may be used when scaling has been performed on the LBR audio frame after it was 
encoded. 

Each bitrate is expressed as a 20-bit value. The 8-bit nLBRBitRateMSnybbles bitstream value carries the 
most-significant 4-bit nibble of the original bitrate and the most-significant 4-bit nibble of the scaled bitrate. 

The LBR original bitrate is determined by the following equation: 

nOriginalBitRate = nLBROriginalBitRate_LSW I ((nLBRBitRateMSnybbles & OxOF) « 16). 
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The LBR scaled bitrate is determined by the following equation: 

nScaledBitRate = nLBRScaledBitRate_LSW I ((nLBRBitRateMSnybbles & OxFO) « 12). 
nLBROriginalBitRate_LSW (least significant word of LBR original bitrate) 

The LBR original bitrate is expressed as a 20-bit value. 

nLBROriginalBitRate-LSW is a little-endian 16 bit field which contains the least-significant 16 bits of the encoded 
LBR stream's original bitrate. 

The LBR original bitrate is determined by the following equation: 

nOriginalBitRate = nLBROriginalBitRate_LSW I ((nLBRBitRateMSnybbles & OxOF) « 16). 
nLBRScaledBitRate_LSW (least significant word of LBR scaled bitrate) 

The LBR scaled bitrate is expressed as a 20-bit value. 

nLBRScaledBitRate-LSW is a little-endian 16 bit field which contains the least-significant 16 bits of the encoded LBR 
stream's scaled bitrate. 

The LBR scaled bitrate is determined by the following equation: 

nScaledBitRate = nLBRScaledBitRate_LSW I ((nLBRBitRateMSnybbles & OxFO) « 12). 



9.3 LBR Audio Data Organization 



The LBR audio payload is organized in a modular form we refer to here as a "chunk". All audio data is contained within 
a chunk or series of chunks. 

9.3.1 Chunks 

The chunk is the basic building block of the LBR bitstream. An LBR chunk is composed of three fields: the chunk ID, 
chunk length and chunk data. 



ID 



length 



data 



Figure 9-1 : Basic Chunk Description 

Organization of the LBR stream into chunks allows for: 

• simplification of adding new bitstream features; 

• forward compatibility. As new techniques and enhancements are developed, old decoders will still be able to 
play the frame portions that they are aware of; 

• scalable bitstream. Simplified post-encode bit-rate scaling of the LBR bitstream; 

• separation of coding components to implement robust interleaving and unequal error protection. 

Chunks may be nested. The higher level chunk is referred to as the "parent" of the chunks it contains. Likewise the 
chunks contained within a parent chunk are referred to as a "child chunk" of the parent. The LBR Frame Chunk, for 
example, is the highest level chunk in the LBR bitstream. 

9.3.1.1 Chunk ID 

The chunkID is used to identify the type of data stored in the chunk. The chunk ID is an 8-bit value providing a total 
of 256 unique chunk IDs. An extension ID is defined to allow for an additional 256 chunk types after the initial 256 
have been used. The most significant bit of the chunk ID is used to indicate whether the chunk is a short chunk 
(maximum data size of 255 bytes) or a standard chunk (maximum data size of 65 535 bytes). A zero bit indicates a short 
chunk while a one bit indicates a standard chunk. 
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Table 9-8 lists the binary values of the Chunk IDs that have been defined. 



Table 9-8: Chunk ID Table 



id value 


Name 


Description 


>/Aaa aaaa 

xOOO 0000 


» . . ii i r~\ 

null ID 


hli ill U i i a 1, 

Null Chunk 


w nAn aaah 

XOOO 0001 


pad ID 


Program Associated Data Chunk 


i,aaa a h a a 

XOOO 01 00 


i d d in 
LdH ID 


LBR Frame Chunk with checksum 


,,nnn a h h a 

XOOO 01 1 


LBK id no checksum 


LBR Frame Chunk without checksum 


..Ann h a j a 

xOOO 1 01 


■ i — i — Ar^in/~ i ik/i 1 1 — \ 

LFE ADPCM ID 


■ r~ I — /^l,,,„i, 

LFE Chunk 


>/AAA H A H H 

xOOO 1 01 1 


EmbLevels ID 


Embedded Channel Sets Chunk 


XOOO 1 1 00 


WmarklDI 


Reserved 


XOOO 1101 


WmarklDI 


Reserved 


wAAA h h h a 

XOOO 1110 


scalefactor ID 


Tonal Scale Factors Chunk 


,,nA j aaaa 

x001 0000 


+ i i r~» 

tonal ID 


Tonal Data Chunk (combined) 


x001 0001 


tonalGroup1_ID 


Tonal Data Chunk Group 1 


xOOl 0010 


4 A A A 1 w A I I m A I r*\ 

tonalbroup^lD 


Tonal Data Chunk Group 2 


wAAH A AH H 

X001 001 1 


4- A A A 1 V A 1 1 A O 1 1^ 

tonalbroupolD 


Tonal Data Chunk Group 3 


,,AA H AH A A 

xOOl 0100 


+ A A A 1 L» A I ■ *A A 1 

tonal(jroup4_llj 


Tonal Data Chunk Group 4 


»,A A H A H A H 

xOOl 0101 


tonalGroup5_ID 


Tonal Data Chunk Group 5 


v-AAH AH H A 
XUU1 U1 1 U 




Tonal Data Chunk (combined, scalefactors used) 


x001 0111 


tnlScf_Group1_ID 


Tonal Data Chunk Group 1 (scalefactors used) 


x001 1000 


tnlScf_Group2_ID 


Tonal Data Chunk Group 2 (scalefactors used) 


x001 1001 


tnlScf_Group3_ID 


Tonal Data Chunk Group 3 (scalefactors used) 


x001 1010 


tnlScf_Group4_ID 


Tonal Data Chunk Group 4 (scalefactors used) 


x001 1011 


tnlScf Group5 ID 


Tonal Data Chunk Group 5 (scalefactors used) 


x01 1 pppp 


gridl ID 


Residual Data Chunk. 1 st scalefactors grid 


x100 pppp 


hiGridsID 


Residual Data Chunk. High resolution scalefactor grids 


x1 01 pppp 


tsmpllD 


Residual Data Chunk. Timesamples, 1 st part 


x1 10 pppp 


tsmp2ID 


Residual Data Chunk. Timesamples, 2 nd part 


x1 1 1 1111 


extensionID 


Extension ID 



The most significant bit of the chunk ID (x) is used to indicate whether the chunk is a short chunk (0) or a standard (1) 
chunk, 'pppp' in residual chunk ID values is used to indicate channel pair number (0 to 15). 

9.3.1 .2 Extended ID Chunks 



To allow for an additional 256 chunk IDs, an extended ID mechanism is defined. When chunkID is equal to 
extensionID, the 8 bits following the Length field are the extended ID. 





Extended ID Chunk 




extensionID 


length 


extID 


data 



Figure 9-2: Extended ID Chunk 



9.3.1.3 Chunk Length 

The chunkLength indicates the length of the data portion of the chunk in bytes. The chunk length is 8-bits for short 
chunks and 16-bits for standard chunks, thus the maximum data size of a short and standard chunk is 255 and 65 535 
bytes respectively. 
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Table 9-9: ChunkLengthlnfo 



Syntax 



Size (Bits) 



ChunkLengthlnfo ( ) 
{ 

chunkID = ExtractBits (8) 

if( (chunkID & 0x80) == ) { 

chunkLength = ExtractBits ( 8 ) 

return (1) 

} 

else { 

chunkLength = ExtractBits ( 16 ) 
return (2) 

} 

} 



8 
8 

16 



If chunkID byte has the most-significant bit clear, the following 8 bits are an unsigned integer representing the chunk 
length. If chunkID byte has the most-significant bit set, the following 16 bits (most significant byte first) are an 
unsigned integer representing the chunk length. 

9.3.1.4 Data 

The data field contains the data of the chunk. The format of the data is specific to the type of chunk. 

9.3.1.5 Checksum Verification and Descrambling 

The checksum is the 16-bit sum of all the bytes in the frame except for two checksum bytes. The LBR Frame 
Chunk does not always carry all types of tabulated chunks. Some chunks may be missing due to bitstream fitting 
operations, transmission channel losses, or may be intentionally not coded during encoding. The decoder still should be 
able to recover remaining information from chunks present. 

If chunkID equals LBR_CID, a checksum has been computed and is stored in the bitstream in the 2 bytes following 
the chunk length. chunkHeaderBytelen will be 2 if ChunkLengthInfo() is 8 bits long, or 3 if ChunkLengthInfo() is 
16 bits long. 



Table 9-10: ChecksumVerify 



SYNTAX 


Size (Bits) 


ChecksumVerify (chunk) 
{ 

nChecksum = 

nHeaderLength = ChunkLengthlnfo {) +1 
nChunkLength -= 2 

// Get checksum stored in the bit stream (MSBF) 

nStoredChecksum = ExtractBits ( 16 ) 

// Calculate the checksum on the header 

for (i = nHeaderLength- 1 ; i >= 0; i--) 

nChecksum += chunk [i] 
// Calculate the checksum on the frame data 

// Note: the stored checksum is not included in the calcuation 
i = nHeaderLength + 2 
count = nHeaderLength + nDataLength 
for { ; i < count; i++) { 
nChecksum += chunk [i] 
bCheckSumSuccess = (nChecksum == nStoredChecksum) 
return bCheckSumSuccess ? SUCCESS : FAILURE 

} 


Table 9-9 
16 
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9.4 



LBR Frame Chunk 



The LBR bitstream is organized in audio frames, each representing a uniform time period. The number of audio 
samples generated from each frame is dependent on the sample rate of the decoded audio. The LBR Frame Chunk is the 
parent chunk to contain all coded audio and side information chunks for a given audio frame. 

The coded audio chunks within the frame are organized in a sequence. A chunk may be either an elemental component 
of the encoded audio or a collection of these elementary component chunks. 

Tonal chunks and residual chunks comprise the two main component chunk types. If a Low Frequency Effects (LFE) 
channel exists, it is coded using a special LFE chunk type. 

Both the tonal and residual chunks have multiple resolutions of components and the residual chunks are further coded in 
a multi-resolution grid structure. 

A typical LBR frame is shown in Figure 9-3. 



<0 

CD (D 
- 1 X 

W <D 

Q I 



2 



XL 
C 
ZJ 

o 



CD 













c 


c 


c 


c 


c 


zz 


zz 








.c 


sz 




sz 




o 


o 


o 


o 


a 


cc 


cc 


cc 


cc 


cc 


c 


c 


c 


c 


c 


o 


o 


o 


o 


o 


1- 


1- 


1- 


1- 


1- 



r " 

CO 
Q_ 



i: 



Frame chunk Data 



V) 
(0 


CM 

<n 
(0 


mesamp 
chunk 


mesamp 
chunk 


I- 


i- 



Figure 9-3: Example LBR Frame 



9.5 LBR Decoding 

The decoding process is illustrated in Figure 9-4. The input data for the decoding process consists of the input bitstream 
containing packed data frames and minimal side -band information consisting of: 

• Sample rate (Hz) 

• Number of channels 



• Bit rate of the input bitstream (bits/sec) 

Decoding can be done on a frame -by-frame basis or at a smaller subframe granularity. The entire decoding process can 
be divided into number of separate decoding sub-processes: 

• Bitstream parsing 

• Tonal part decoding 

• Residual decoding 

• LFE channel decoding (if LFE is present) 

The output of each decoding sub-process (other than bitstream parsing) is a sequence of time domain samples 
corresponding to the data in the current frame (or subframe). These output time-samples should be added together (no 
shift or alignment required) to obtain the complete decoded signal data. Note that unpacking each packed frame 
provides samples for the current frame plus additional samples which should be added to the results of the next frame 
that is decoded. 



Decoding of LFE channels is independent of the other channels. 
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Bitstream 
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Figure 9-4: LBR Decoder Overview 

The encoded data stream is composed of a sequence of individual frames. The number of samples resulting from 
decoding a frame depends on the sample rate of the audio signal and nominal values are shown in Table 9-11. 

Table 9-11 : Sample Rate to Frame Size Relationship 



Sample Rate 


Number of input samples 


< 16 kHz 


1 024 samples 


> 16 kHz but < 32 kHz 


2 048 samples 


> 32 kHz but < 50 kHz 


4 096 samples 



The LBR decodes each input frame as 16 smaller subframes. 

Table 9-12: Decode Frame 



Syntax 



Size (Bits) 



DecodeFrame ( ) { 

if (chunkID == LBR_ID) { 

if (FAILURE == ChecksumVerify (chunk) ) 
return 

} 

for (sf=0; sf < 16; sf++) 
DecodeSubFrame ( sf ) 



Table 9-10 



Table 9-13 
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Table 9-13: Decode SubFrame 



Syntax 



Size (Bits) 



(LBR_FLAG_STEREO_DOWNMIX | | 



(LBR FLAG USE LFE) 



DecodeSubFrame (nSubFrameCount ) 
InitializeParameters { ) 
if (bLBRCompressedFlags && 
LBR_FLAG_MULTI CHANNEL_DOWNMIX ) 
ECSChunk ( ) 
if (bLBRCompressedFlags && 

LFEChunk ( ) 
if (nSubFrameCount == 0) 
ResidualChunksPartl ( ) 
// GridlChunk 
// HiResGridChunk 
// TimeSampleslChunk 
// TimeSamples2Chunk 



// tonal vs residual shift is 11 subframes 
if (nSubFrameCount == 11) { 

ScaleFactorChunk ( ) 

Tonal Chunk ( ) 

} 

for (ch=0, ch < nPair, ch+=2) { 

ResidualChunksPart2 (ch, nSubFrameCount) 
// complete decoding of residual subbands 



} 

padChunk ( ) 
nullChunk ( ) 



Table 9-20 

Table 9-38 
Table 9-36 
Table 9-21 



Table 9-14 
Table 9-15 



Table 9-32 



Table 9-39 
Table 9-40 



9.5.1 Tonal Decoding 

The Tonal Chunks are used to store the tonal coding information of the LBR algorithm. Either a separate chunk is used 
for each different transform size (also called 'group'), or all transform sizes together with tonal scale factors are 
collected into one chunk. 



9.5.1 .1 Overview of Tonal Decoding 

The information for tonal decoding process consists of base functions divided into groups by length. Base functions are 
spread by time in the frame and only some of them have non-zero values in a given subframe. There are 5 tonal groups 
in total. The length of the base functions in each of the groups, measured in terms of subframes, is as follows: 

• 1 st group (nGroup=o) length is 2 subframes; 

• 2 nd group (nGroup=i) length is 4 subframes; 

• 3 rd group (nGroup=2) length is 8 subframes; 

• 4 th group (nGroup=3) length is 16 subframes; 

• 5 th group (nGroup=4) length is 32 subframes. 

The difference values obtained from the bitstream for the secondary audio channels (for stereo and multichannel 
signals) should be converted to absolute values. The amplitude component on the secondary channel is always lower 
than the amplitude of the primary channel. Quantized amplitudes should be converted to linear values using the 
quantized amplitude to linear amplitude conversion table in clause 9.8.1. 

Tonal decoding steps are as follows: 

• Initialize frequency domain subframe with zero values. 

• Unpack tonal scale factors. 

• Unpack tonal components and adjust each with the corresponding scale factor. 



ETSI 



156 



ETSI TS 102 114 V1. 4.1 (2012-09) 



• Synthesize and add required portions of l st -5 th group of base functions into same subframe. 

• Convert frequency domain subframe data into the time domain using an inverse MDCT followed by 
windowing. This step can be combined with residual reconstruction in the second stage of the hybrid 
filterbank. 

Note that the base function synthesis operations (step 4) can be performed in any order. 

9.5.1 .2 Tonal Scale Factors Chunk 

The Scale Factors Chunk is used to store scale factors for tonal components. 



Table 9-14: ScalefactorsChunkQ 



Syntax 


Size (Bits) 


ScaleFactorsChunk ( ) 
{ 

if (chunkID != scalef actor ID) { 

return 
ChunkLengthlnf o { ) 

for (scfBand=0; scfBand < 6; scfBand++) 

nScaleFactor [scf Band] = ExtractBits ( 6 ) 
ByteAlign { ) 

} 


8 

Table 9-9 

6 
0..7 



9.5.1 .2.1 Tonal scale factor processing 

At the encoder, the minimum amplitude out of all components within a correspondent frequency range is selected as its 
scale factor, so each component belonging to this frequency range is then adjusted with the scale factor before packing. 

At the decoder, it is not necessary to de-quantize the tonal scale factors; instead, the unpacked value should be added to 
the quantized amplitude of all tonal components within the correspondent frequency range before de -quantization. In 
the case when more than one channel having a component at the same frequency, amplitude of the channel with highest 
level should be treated as described above and the quantized amplitude difference of all secondary channels (channels at 
lower level) should be subtracted from treated maximum amplitude before quantization, see clause 9.5.1.3 for details. 

Correspondence between frequency and scalefactor index is described in Table 9-4 (see 'FreqToSf ). 

9.5.1.3 Tonal Chunks 

Two different versions of chunks exist, depending on whether scale factors are used. Those versions are identical from 
the bitstream point of view, except for the chunk ID field. All tonal chunks have the syntax shown below. 

When a separate chunk is used for each different transform size, or group: 



Table 9-15: TonalChunkQ 



Syntax 


Size (Bits) 


TonalChunk ( ) 
{ 

if (chunkID == tonalGroupX ID | | chunkID == tnlScf GroupX ID) { 

// where X is a digit between 1 and 5 which represents transform size 

ChunkLengthlnf o ( ) 

DecodeTonal (X-l) 

ByteAlign ( ) 

} } 


8 

Table 9-9 
Table 9-17 
0..7 
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When all transform sizes together with tonal scale factors collected into one chunk: 

Table 9-16: TonalChunkQ (with scalefactors) 



Syntax 



Size (Bits) 



TonalChunk { ) 
{ 

if (chunklD != tnlScf_ID) 

return 
ChunkLengthlnf o { ) 
for (i=0, i < 5; ++i) 
switch (nChunkID) 
{ 

default : 

case tnlScf_Group5_ID : 
case tonalGroup5_ID : 

nGroup = 
break ; 

case tnlScf_Group4_ID : 
case tonalGroup4_ID : 

nGroup = 1 
break ; 

case tnlScf_Group3_ID : 
case tonalGroup3_ID : 

nGroup = 2 
break ; 

case tnlScf_Group2_ID : 
case tonalGroup2_ID : 

nGroup = 3 
break ; 

case tnlScf_Groupl_ID : 
case tonalGroupl_ID : 

nGroup = 4 
break; 

} 

DecodeTonal (nGroup) 
ByteAlign { ) 



8 

Table 9-9 



Table 9-17 
0..7 



Code common to both types of tonal chunk types: 

Table 9-17: Decode Tonal 



Syntax 



Size (Bits) 



DecodeTonal (nGroup) 
{ 

nSubFrame = 1 
nFrequency = 1 
nFrequencyDif f = 
RemainingTonesForThisGroup = 1 
iterations = 1 

while (RemainingTonesForThisGroup) { 
while (iterations) { 

nFrequencyDif f = getVariableParam (prsDist [4 -nGroup] ) 
if (nFrequencyDif f>>2 > 0) { 

bitlength = nFrequencyDif f >> 2 

nFrequencyDif f = ExtractBits (bitlength) + f stAmp [nFrequencyDif f] 

} 

else{ 

nFrequencyDif f = f stAmp [nFrequencyDif f] 

} 

if (nFrequencyDif f > 1) { 
iterations = 
break 

} 

nFrequency = 1 

if (nFrequencyDif f = 0) 

nSubFrame += 1 
else 

nSubFrame += 8 



Table 9-18 



bitlength 
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Syntax 


Size (Bits) 


if (nSubFrame > (1 << nGroup) ) 

RemainingTonesForThisGroup = 

} 

nFrequency += nFrequencyDif f - 2 
if { nChannels > 1) 

mainChldx = ExtractBits (BitsForChNum [nChannels] - 1) 
else 

mainChldx = 

tonalAmplitudeMain = getVariableParam ( tnlScf ) + nScalef actors [FreqToSf [nFrequency 
>> (7-nGroup)]] 

tonalPhaseMain = ExtractBits ( 3 ) 
for (ch = 1; ch < nChannels; ch++) { 
if (ch != mainChldx) { 

chPresence = ExtractBits ( 1 ) 
if (chPresence) { 

nAmplitudeDif f = getVariableParam (dAmp) 
nPhaseDiff = getVariableParam (dPh) 

tonalAmplitudeSecondaryDif f = TonalAmplitudeMain - nAmplitudeDif f 
tonalPhaseSecondaryDif f = tonalPhaseMain - nPhaseDiff 

} } } 
nFrequency++ 

} } 


variable 

Table 9-18 
3 

1 

Table 9-18 
Table 9-18 



where: 



tables for prsDist[n], tnlScf, dAmp and dPh are defined in clause 9.8.10, 

fstAmp [44] = 
{ 



0, 


1, 


2 , 


4 , 


4, 


s, 


8, 


10 , 


12 , 


16, 


20, 


24, 


28, 


36, 


44, 


52, 


60, 


76, 


92 , 


108, 


124 , 


156, 


188, 


220, 


252 , 


316, 


380, 


444 , 


508, 


636, 


764 , 


892, 


1020, 


1276, 


1532 , 


1788, 


2044 , 


2556, 


3068, 


3580, 


4092, 


5116, 


6140, 


7164 



} 

FreqToSf [32] = 
{ 

0, 
1, 

2 , 2 , 

3 , 3 , 3 , 3 , 

4, 4, 4, 4, 4, 4, 4, 4, 

5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5, 5 

} 



BitsForChNum [10] = 

{ // channel number 



1, 


// 





1, 


// 


1 


2 , 


// 


2 


2 , 


// 


3 


3, 


// 


4 


3, 


// 


5 


3, 


// 


6 


3, 


// 


7 


4 , 


// 


8 


4 


// 


9 
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Table 9-18: getVariableParamQ 



syntax 


Size (Bits) 


getVariableParam (table) 






{ 

index = 1 
nbits = 

while (table [index] .A != OxFF) { 

Nvalue = table [index] . ExtractBits (1) 
index += Nvalue 

} 

Nvalue = table [index] .B 

if (nValue = 0) { 

nbits = ExtractBits (3) +4 
return (ExtractBits (nbits ) ) 

} 

else 

return (Nvalue -1) 

} 


//O == A, 1 == B 


1 




3 

nbits 







GetVariableParam ( ) is used by both Tonal and Residual functions to fetch variable bit length parameters from the 
bitstream. The table entries each have a sub-index referring to column A (0) or column B (1) and result in a table walk 
or "index hopping" to arrive at the correct offset in statistically the fewest number of bits. 

The tables passed into this function are all in clause 9.8.10. The resulting index is used to locate a base value in 
FstAmp [] which also determines the number of additional bits required to extract from the bitstream to code the residual 
value. 

9.5.1 .3.1 Tonal components processing 

In the bit stream, the components are packed in ascending frequency order and each active bin position may contain 
component from one or more channels. In the multi-channel case, the channel with the highest amplitude would appear 
first, coded using absolute amplitude (with scale factor adjustment) and phase. All secondary channels at lower levels 
are then difference-encoded against the maximum channel. 

Bitstream parsing provides the following values (see clause 9.3): 

• quantized amplitude (LogAmplitude); 

• quantized phase (Phase); 

• spectral line number (Freq); 

• position of base function in the frame; 

• per-channel 'presence' of the component - the bit map of the channels where component has non-zero 
amplitude. 

After the scale factor/maximum channel adjustment, quantized amplitudes should be converted to linear values using 
the quantized amplitude to linear conversion table in clause 9.8.1. 

Frequency domain subframe data consists of a number of MDCT spectral lines. The number of lines (N) used depends 
on the sample rate according to Table 9-19. 



Table 9-19: Subframe Resolution 



Sample rate 


Number of spectral lines in subframe 


< 16 kHz 


64 


> 16 kHz but < 32 kHz 


128 


> 32 kHz but < 50 kHz 


256 



NOTE: This frequency domain data results in 2 x N time domain samples. The first half of these samples should 
be added to the last half from the previously decoded subframe (which is automatically done by hybrid 
filterbank). The resulting N samples are the output of the tonal decoding process. 
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9.5.1.3.2 



Base-functions synthesis 



The Base function is a finite-length time -domain function used in tonal encoding and decoding. The base function is 
described by the formula: 

l-cos(— ln 
F(t;A,l,f,<p) = A s ' m (~' f ' t + ( P^ where fe[0,Z) 

F(t;A,l,f,<p)=0; 



where: 




A 


amplitude; 


t 


time variable ( t e N ); 


<P 


phase; 


1 


function length, power of 2 (=128..8192); 


f 


frequency / e [1, ^) 



Real functions used in the algorithm are approximations of this function and described by fast-synthesis algorithm 
below. This formula is also used to create the table 'wavSynEnvelope[]' which is used to shape the tonal components 
(base functions) in tonal coding. 

Tonal base functions are spread over a number of subframes. The output of synthesizing these base functions is applied 
to all subframes where the synthesized base function has non-zero values. The synthesis is done by updating Amplitude 
and Phase values according to the frequency and the length of the base function. A number of neighbour spectral lines 
are also modified by 'CorrCf[][]' to reduce distortions produced by this synthesis method. Pseudo-code below illustrates 
the process: 



ph0_shift[8] 
xFreq = Freq >> 
F_dlt = Freq & 
PhaseRotation = 
CurPhase = 128 



-96, 



1-32, + 96, 



-32, +32, 



-96 



{-32, +96, 
(nGroup + 1) ; 
(1<< (nGroup+1) -1) << ( 4 -nGroup ) ; 
256 - ((F_dlt + (xFreq&l) *32) *4) ; 
- 64*Phase - { (PhaseRotation << nGroup+1) 



ph0_shift [ (xFreq&3) *2 + ((Freq &1))] + (xFreq: 
for (index = 0; index < Length; index = index+1) { 



PhaseRotation) + 
•>D ; 



CurAmplitude = Amplitude 
WavSyn [index] [xFreq-5] += 
WavSyn [index] [xFreq-4] += 
WavSyn [index] [xFreq-3] += 
WavSyn [index] [xFreq-2] += 
WavSyn [index] [xFreq-1] += 
WavSyn [index] [xFreq] += 
WavSyn [index] [xFreq+1] 
WavSyn [index] [xFreq+2] 
WavSyn [index] [xFreq+3] 
WavSyn [index] [xFreq+4] 
WavSyn [index] [xFreq+5] 



+= 
+= 
+= 

CurPhase += PhaseRotation; 



wavSynEnvelope [index] ; 
CurAmplitude*CorrCf [F_dlt] [0] 
CurAmplitude*CorrCf [F_dlt] [1] 
CurAmplitude*CorrCf [F_dlt] [2] 
CurAmplitude*CorrCf [F_dlt] [3] 
CurAmplitude*CorrCf [F_dlt] [4] 
CurAmplitude*CorrCf [F_dlt] [5] 
CurAmplitude*CorrCf [F_dlt] [6] 
CurAmplitude*CorrCf [F_dlt] [7] 
CurAmplitude*CorrCf [F_dlt] [8] 
CurAmplitude*CorrCf [F_dlt] [9] 
CurAmplitude*CorrCf [F_dlt] [10 



sin (2*Pi*CurPhase/256 
sin (2*Pi*CurPhase/256 
sin (2*Pi*CurPhase/256 
sin (2*Pi*CurPhase/256 
sin (2*Pi*CurPhase/2 5 6 
sin (2*Pi*CurPhase/256) 
sin (2*Pi*CurPhase/256 
sin (2*Pi*CurPhase/2 5 6 
sin (2*Pi*CurPhase/2 5 6 
sin (2*Pi*CurPhase/2 5 6 
]* sin (2*Pi*CurPhase/256 



5*Pi/2) 
4*Pi/2) 
3*Pi/2) 
2*Pi/2) 
l*Pi/2) 

l*Pi/2) 
2*Pi/2) 
3*Pi/2) 
4*Pi/2) 
5*Pi/2) 



where: 
WavSyn 
Freq 

Amplitude 



spectral line of MDCT 

values obtained from the bitstream (see Table 9-17) 

linear amplitude of base function, derived from LogAmplitude by using 
the table in clause 9.8.1 



wavsynEnvelope [index] envelope values obtained from the table in clause 9.8.2 
nGroup group number (0 for shortest base functions, 4 for longest) 

Length length of base function (Length = 1 « nGroup+ 1) 
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CorrCf[F_dlt][ ] correction coefficient given by the table in clause 9.8.3 (corresponding table 

should be used for each given group) 

F_dlt difference between original frequency and center frequency of the spectral 

line used for synthesis 

As the synthesis is done on a subframe basis only one iteration of the cycle should be done and CurPhase and 
CurAmplitude should be stored for synthesizing the next subframe. 

If the spectral line number referred to by the algorithm appears to be negative (this occurs when xFreq is to 4 and the 
calculation refers to the terms [xFreq-5] to [xFreq-1]), a mirroring effect takes place. The mirrored spectral 
line (the number of this spectral line is the absolute value of the negative spectral line number) should be corrected 
using the negative of the complex tabulated coefficient. 

9.5.2 Residual Decoding 

Residual decoding is a scalable scheme. Each residual frame produces the same number of samples as one LBR frame. 

9.5.2.1 Residual Decoding Overview 

At the encoder, the residual samples of each primary channel are processed by filters producing 32 uniform frequency 
subbands, each containing 128 time samples. From the subband samples, scale factors are calculated to normalize the 
samples before quantization. 

There are potentially 4 different residual coding chunks, in each LBR frame: 

• Gridl chunk - contains low resolution grid of scale factors. 

• High resolution grids chunks (two types) - contains high-time resolution grid of scale 
factors and high-frequency resolution grid of scale factors (Grid2 and Grid3). 

• Timesamples chunks - time sample information for each of the filterbank subbands. 
Residual decoding involves the following steps: 

1) Unpack and decode Gridl, Grid2, Grid3 scale factors, then use them to reconstruct the high resolution 
(Hi Res) scale factor grid. 

2) Unpack LPC coefficients and use them with the prediction error unpacked in step 3 to synthesize the subband 
samples. 

3) Unpack LPC errors and time samples using quantizers specified by the quantization profile. 

4) Rescale with corresponding Hi Res scale factors. 

5) Reconstruct residual time samples with inverse filterbank. 
These steps are further described in the following clauses. 
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9.5.2.2 Unpacking and Decoding Residuals 

Table 9-20: Residual parameter initialization 



Syntax 



InitializeParameters ( ) 
{ 

FrequencyRange = SAMPLERATE_TO_FREQRANGE (nSampleRate ) 
nTotalSubbands = 8 << (FrequencyRange) 
nBaseBitRate = (500+250 * nChannels) 
if (nBitRate >= 176 * nBaseBitRate) 

nResProfile = 2 
else if (nBitRate >= 100 * nBaseBitRate) 

nResProfile = 1 
else 

nResProfile = 

G3AvgOnlyStartSB = nTotalSubbands * 1000 * Prof iles [nResProfile] . nAvgGrid3 Frequency / 
(nSampleRate/2 ) 

if (G3AvgOnlyStartSB > nTotalSubbands) 

G3AvgOnlyStartSB = nTotalSubbands 
MinMonoSubband = nTotalSubbands * 1000 * Prof iles [nResProfile] . nMinMonoFrequency / 
(nSampleRate/2 ) 

if (MinMonoSubband > nTotalSubbands) 

MinMonoSubband = nTotalSubbands 
MaxMonoSubband = nTotalSubbands * 1000 * 14 / (nSampleRate/2 ) 
if (MaxMonoSubband > nTotalSubbands) 

MaxMonoSubband = nTotalSubbands 
SecChPres = 
TSCodingMethod = 



where: 

Profiles!] = {nMinMonoFrequency, nAvgGrid3 Frequency} 

Profiles [3] = 

{ 

{ 2, 16 }, 
{ 2, 18 }, 
{ 2, 24 } 

} 

S AMPLER ATE_TO_FREQRANGE() is defined by Table 9-4 and other persistent parameters are defined in 
clause 9.1.1. 



Table 9-21 : Residual Chunks Part 1 



syntax 


Size (Bits) 


ResidualChunksPartl ( ) { 

for (ch=0, ch < nPair, ch+=2) { 

// the last pair of channel is mono 

// when the number of fullband channel is odd 

if ((ch+2) = nChannels) { 

oneChPair = TRUE 
}else{ 

oneChPair = FALSE 

} 

GridlChunk (ch, oneChPair) 
HiResGridChunk (ch, oneChPair) 
TimeSampleslChunk (ch, oneChPair) 
TimeSamples2Chunk (ch, oneChPair) 

} } 


Table 9-23 
Table 9-24 
Table 9-22 
Table 9-25 
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Table 9-22: TimeSamples 1 Chunk 



syntax 


Size (Bits) 


TimeSamples IChunk (ch, oneChPair) { 
DecodeLPC(2, 3) 

DecodeTS(ch, 2, 4, 0, oneChPair) 

DecodeGrid2 (0, 1) 

DecodeTS(ch, 4, 6, 0, oneChPair) 

} 


Table 9-31 
Table 9-28 

Table 9-28 



Table 9-23: GricM Chunk 



syntax 



GridlChunk (ch, oneChPair) { 

// decode scale factors nSubbandScf Gridl 

nTotalSubbandsGl = Scalef actorToGridl [nTotalSubbands-1] + 
for (nSubband = 2; nSubband < nTotalSubbandsGl; nSubband+- 
DecodeScaleFactors { nSubbandScf Gridl [ch] [nSubband] [] ) 



■){ 



Table 9-29 



// decode average values for 3rd grid 
if (nTotalSubbands > 4) 

nTotalSubbandsG3 = nTotalSubbands -4 + 1 
else 

nTotalSubbandsG3 = 1 
for (nSubband = 0; nSubband < nTotalSubbandsG3 ; nSubband+ + ) { 
nSubbandAvgGrid3 [ch] [nSubband] = getVariableParam ( avgG3 ) 
nSubbandAvgGrid3 [ch] [nSubband] -= 16 
if ( oneChPair=FALSE ) { // "stereo" pair 
if ( (nSubband+4) >= 2 ) { 
// copy primary to secondary 

nSubbandAvgGrid3 [ch+1] [nSubband] = nSubbandAvgGrid3 [ch] [nSubband] 

} 

else{ // read data from bitstream for the 2 first subbands 

nSubbandAvgGrid3 [ch+1] [nSubband] = getVariableParam ( avgG3 ) ; 
nSubbandAvgGrid3 [ch+1] [nSubband] -= 16; 

} } } 

// get stereo image for partial mono mode 
if (nBitsLeft >= 8) { 

if ( oneChPair=FALSE ) { 

nPartialStereoSubband = 

nMin[0] = ExtractBits (4 ) ; 

nMin[l] = ExtractBits (4 ) ; 

for (nSubband = 2, nSubband < nTotalSubbands; nSubband += 4) { 
for (nCh = ch; nCh <= (ch+1) ; nCh++) { 

for (nSubSample = 3; nSubSample >=0; nSubSample- - ) { 

nSubbandPartStereo [nCh] [nPartialStereoSubband] [nSubSample] = 
getVariableParam ( stGrid) + nMin [nCh-ch] 
} } 

nPartialStereoSubband++ 

} } } 

// get low-resolution spatial information 
for (nCh = 2; nCh < nChannels; nCh++) { 

nMin[0] = ExtractBits (4 ) 

nPartialStereoSubband = 

for (nSubband = 0, nSubband < nTotalSubbands; nSubband += 4){ 
for (nSubSample = 3; nSubSample >=0; nSubSample- -) { 

nSubbandPartStereo [nCh] [nPartialStereoSubband] [nSubSample] = 
getVariableParam ( stGrid) + nMin [nCh-ch] 
} 

nPartialStereoSubband++ 

} } } 



Table 9-18 



Table 9-18 



Table 9-18 



Table 9-18 



where 

stGrid and avgG3 are defined in clause 9.8.10. 
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Table 9-24: High Resolution Grid Chunk 


syntax 


Size (Bits) 


HiResGridChunk (ch, oneChPair) { 

// quantizer profile is read from bitstream 
nQuantizerProf ile = ExtractBits ( 8 ) 

// calculate quantization levels for each subband 
OL = (nQuantizerProf ile & 0x38) >> 3 // OL: overall level 
ST = (nQuantizerProf ile & OxCO) >> 6 // ST: steepness 
maxSB = nQuantizerProf ile & 0x07 
// calculate levels according to a formula 
for (sb=0; sb<nTotalSubbands ; ++sb) { 
F = sb*nSampleRate/nTotalSubbands 
A = 18000/ (12*F/1000+100+40*ST) + 20*OL 
// translate dB into quantization level indices 
if (A <= 95) 

QLevels [sb] =1 // 1.0 bits 
else if (A <= 140) 

QLevels [sb] =2 // 1.6 bits 
else if (A <= 180) 

QLevels [sb] =3 // 2.4 bits 
else if (A <= 230) 

QLevels [sb] =4 // 3 . bits 
else 

QLevels [sb] =5 // 4.0 bits 

// reorder quantization levels for lower subbands according to maxSB 

QLevels [maxSB] = QLevels [0] 

// get LPC for the first two subbands 

DecodeLPC(0, 2) 

// get time-samples for the first two subbands of main channel 
DecodeTS(ch, 0, 2, 0, oneChPair) 

// get the first two bands of the first grid from bitstream 
for (nSubband = 0; nSubband < 2; nSubband++) 


8 


Table 9-31 
Table 9-28 


{ 

for (nCh = ch; nCh <= (ch+1) ; nCh++) { 

DecodeScaleFactors ( nSubbandScf Gridl [nCh] [nSubband] [] ) 

} } } 


Table 9-29 


Table 9-25: TimeSamples 2 Chunk 


syntax 


Size (Bits) 


TimeSamples2Chunk ( ch, oneChPair) { 
DecodeGrid2 (1, 4) 

DecodeTS(ch, 6, nMaxMonoSubband, 0, oneChPair) 
if ( oneChPair = FALSE ) { 

DecodeGridl (ch) 

DecodeGrid2 (ch, 0, 4) 

} 

DecodeTS(ch, nMinMonoSubband, nTotalSubbands , 1, oneChPair) 

} 


Table 9-28 


Table 9-28 


Table 9-26: Decode Gridl 


syntax 


Size (Bits) 


DecodeGridl (ch) { 

nTotalSubbandsGl = Scalef actorToGridl [nTotalSubbands-1] + 1 
for (nSubband = 2; nSubband < nTotalSubbandsGl; nSubband++) { 
DecodeScaleFactors ( nSubbandScf Gridl [ch+1] [nSubband] [] ) 

} } 


Table 9-29 
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where: 

Scalef actorToGridl [64] = 
{ 

0,1,2,3,4,4,5,5,6,6,6,6,7,7,7,7,8,8,8,8,8,8,9,9,9,9,10,10,10,10,10,10, 
11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 11, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 12, 
12, 12, 12, 12, 12, 12, 12, 12 

} 



Table 9-27: Decode Grid2 



syntax 



Size (Bits) 



DecodeGrid2 (ch, startSB, endSB) { 

nTotalSubbands = Scalef actorToGrid2 [nTotalSubbands-1] + 1 
if (endSB > nTotalSubbands) 

endSB = nTotalSubbands 
for (nSubband = startSB; nSubband < endSB; nSubband++) { 
for (nCh = ch; nCh <= (ch+1) ; nCh++) { 
for (i = 63; i >= 0; i -= 8) { 
nValue = ExtractBits ( 1 ) 
if (nValue) { 

for (j = 7; j >= 0; j--) { 
nValue = ExtractBits ( 5 ) 
r = Grid2Codes [nValue] 
r &= 31 
if (r<31) 

nSubbandScfGrid2 [nCh] [nSubband] [j] = r 
if ( ExtractBits (1) ) { 

nSubbandScfGrid2 [nCh] [nSubband] [j] =6 
break 

if ( ExtractBits (1) ) { 

nSubbandScfGrid2 [nCh] [nSubband] [j] =7 
break 

} 

if ( ExtractBits (1) ) { 

nSubbandScfGrid2 [nCh] [nSubband] [j] =8 
break 

} 

if ( ExtractBits (1) ) { 

nSubbandScf Grid2 [nCh] [nSubband] [j] =9 
break 

} 

if ( ExtractBits (3) = 3 ){ 

nSubbandScfGrid2 [nCh] [nSubband] [j] = Oxa 
break 

} 

if ( ExtractBits (3) = 3 ){ 

nSubbandScfGrid2 [nCh] [nSubband] [j] = Oxb 
break 

} 

if ( ExtractBits (7) = 2 ){ 

nSubbandScfGrid2 [nCh] [nSubband] [j] = Oxc 
break 

} 

if ( ExtractBits (7) = 6 ){ 

nSubbandScfGrid2 [nCh] [nSubband] [j] = 0x10 
break 

} 

if( ExtractBits (15) = 5 ){ 

nSubbandScfGrid2 [nCh] [nSubband] [j] = Oxe 
break 

} 

if ( ExtractBits (15) = 9 ){ 

nSubbandScfGrid2 [nCh] [nSubband] [j] = 0x11 
break 

} 

if ( ExtractBits (31) = 1 ){ 

nSubbandScf Grid2 [nCh] [nSubband] [j] = 0x12 
break 



15 



15 



31 
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syntax 


Size (Bits) 


i 

if ( ExtractBits (31) = 17 ) { 

nSubbandScfGrid2 [nCh] [nSubband] [j] = 
break 

} 

if ( ExtractBits (31) = Oxd ){ 

nSubbandScf Grid2 [nCh] [nSubband] [j] = 


Oxd 


O 1 


Oxf 


31 


} 

/ / rare value case 
nBits = ExtractBits (3) + 4 
r = ExtractBits (nBits) 
if (r > 5G) 

nSubbandScfGrid2 [nCh] [nSubband] [j] = 
nSubbandScfGrid2 [nCh] [nSubband] [j] = r 

} 





3 
nbits 


}else{ 

for (j = 7; j >= 0; j--) { 

nSubbandScfGrid2 [nCh] [nSubband] [j] =0 

}}}}}} 







where: 

Scalef actorToGrid2 [64] = 
{ 



0, 


0, 


0, 


0, 


0, 


0, 


0, 


0, 


0, 


0, 


1, 


1, 


1, 


1, 


1, 


1, 


1, 


1, 


2, 


2, 


2, 


2, 


2, 


2, 


2, 


2, 


2, 


2, 


2, 


2, 


2, 


2, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3, 


3 



} 



Grid2Codes [32] = 



66, 
66, 



99, 
99, 



65, 
65, 



64, 
64, 



66, 
66, 



132 , 
132 , 



65, 
65, 



64, 
64, 



66, 
66, 



99, 
99, 



65, 
65, 



64, 
64, 



66, 
66, 



165, 65, 64, 
191, 65, 64 



Table 9-28: DecodeTS 



syntax 



Size (Bits) 



DecodeTS (ch, startSB, endSB, channelFlag, oneChPair) { 

for (nSubbandlndex = startSB; nSubbandlndex < endSB ; nSubbandIndex++ ) { 
nSbReordered = 
if ( nSubbandlndex > 6 ) { 

if ( (channelFlag=0 ) | (nSubbandlndex >= nMaxMonoSubband) ) 
nSbReordered = ExtractBits ( 5 ) 

}else{ 

nSbReordered = nSubbandlndex 

} 

// get grid3 scf 

if (nSubbandlndex = 12) { 

if (G3AvgOnlyStartSB > 4) 

nEndSubbandG3 = G3AvgOnlyStartSB-4 

else 

nEndSubbandG3 = 
for (nSubbandG3 = 0; nSubbandG3 < nEndSubbandG3 ; nSubbandG3++) 
{ 

for (nCh = ch; nCh <= (ch+1) ; nCh++) 

nSubbandScfGrid3 [nCh] [nSubbandlndex] [nSubbandG3] = DecodeGrid3 ( ) 



Table 9-30 



}else if ( (nSbReordered 
if (nSbReordered > 4) 



& (nSubbandlndex < 12) 
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syntax 



nSubbandG3 = nSbReordered-4 
else 

nSubbandG3 = 
for (nCh = ch; nCh <= (ch+1) ; nCh++) { 

nSubbandScfGrid3 [nCh] [nSubbandlndex] [nSubbandG3] = DecodeGrid3 ( ) 

} } 

// stereo matrix 
if ( oneChPair=FALSE ) { 
if (nBitsLeft >= 20) 

nSubbandMidSide [nSbReordered] = ExtractBits ( 8 ) 
if ( ( channelFlag = 1 ) & ( nSbReordered >= nMinMonoSubband) ) 
nLRMS = ExtractBits (8) 

} 

nStartChannel = ch 
nEndChannel = ch+2 

if { (nSbReordered >= nMinMonoSubband) & (nSubbandlndex < nMaxMonoSubband) ) 
{ 

if (channelFlag = 0) 

nEndChannel = ch+1 
else 

nStartChannel = ch+1 

} 

for (nCh = nStartChannel; nCh < nEndChannel; nCh++) { 
overCoded = 
sbCodingFetched = 

for (nSubFrame = 0; nSubFrame < 4; nSubFrame++) { 

// timesample coding method is fetched from the bitstream only once 
if ( ! sbCodingFetched) { 

TSCodingMethod [nCh] [nSbReordered] [nSubFrame] = ExtractBits ( 1 ) 

sbCodingFetched = 1 

} 

switch (QLevels [nSubbandlndex] ) { 
case 1 : 

nBitsToExtract = min (nBitsLeft , 32) 
break 
case 2 : 

if (TSCodingMethod [nCh] [nSbReordered] [nSubFrame] == 0) { 

// in the first method, 5 samples are combined to form an 
// 8-bit number (3^5 = 243 which fits into 8 bits) 
if ( (32-overCoded) % 5 ) 

nBitsToExtract = min (nBitsLeft , 8 * (( 32 -overCoded) / 5 + 1) ) 
else 

nBitsToExtract = min (nBitsLeft , 8 * (( 32 -overCoded) / 5 )) 
overCoded = max(0, 5*nBitsToExtract/8 - ( 32 -overCoded) ) 
}else{ 
i = 

j = nBitPosition 
nValue = ExtractBits ( j ) 
while (j > 0) { 

tmv = (nValue & 1) +1 

nValue = nValue >> tmv 

j -= tmv 

i + + 

} 

nBits = nBitsLeft 
for (i=0;i < 1G & (nBits >= 

nValue = ExtractBits ( 1 ) 

nValue |= ExtractBits ( 1 ) 

if (j < 0) 

nValue = nValue >> 1 

j += 16 

while ( j > 0) { 

tmv = (nValue & 1) +1 
nValue = nValue >> tmv 
j -= tmv 
i++ 



Size (Bits) 

Table 9-30 

8 
8 



variable 



16) ; nBits 



16) { 



} } 



nBitsLeft 



nBits 
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syntax 

if (j < 0) 

ExtractBits (1) 
for (i=0;i < 32;i++){ 
if (nBitsLeft >= 10) { 

for (j = 0; j < 5; j++) { 
if (i + j >= 32) 

break 
if (ExtractBits (1) ) 
ExtractBits (1) 

} } 
i += j 

} } 
break 
case 3 : 

if ( (32-overCoded) % 3 ) 

nBitsToExtract = min (nBitsLeft , 7 * ( 32 -overCoded) / 3 + 1) 
else 

nBitsToExtract = min (nBitsLeft , 7 * ( 32 -overCoded) / 3 ) 

overCoded = max(0, 3*nBitsToExtract/7 - ( 32 -overCoded) ) 
break 
case 4 : 

for (i = 0; i < 32; i++) 

nCodeVal [ExtractBits (8) & £3] 



break 
case 5 : 

nBitsToExtract = min (nBitsLeft , 4 * 32) 
break 



} 



if (nBitsToExtract) { 

if (nBitsToExtract >= 16) { 

nBitsLeft -= ( (nBitsToExtract >> 3) - 2) << 3 
ExtractBits (IS) 

ExtractBits ( nBitsToExtract & 7 ) 
}else{ 

ExtractBits (nBitsToExtract) 

} } } } 

if (loneChPair & (nSubbandlndex >= nMinMonoSubband) & (nBitsLeft >= 20) 
SecChPres = 1 



} } 



where: 

nCodeVal [64] = 
{ 

1, 2, 4, 3, 5, 2, 0, 3, 1, 2, 4, 3, 6, 2, 0, 3, 1, 2, 4, 3, 5, 2, 0, 3, 1, 2, 4, 3, 7, 2, 0, 3, 
1, 2, 4, 3, 5, 2, 0, 3, 1, 2, 4, 3, 6, 2, 0, 3, 1, 2, 4, 3, 5, 2, 0, 3, 1, 2, 4, 3, 4, 2, 0, 3 



Table 9-29: Decode Scalefactors 



syntax 



Size (Bits) 



DecodeScaleFactors ( *nSubbandScf ) { 
subframe = 

nPrevious = getVariableParam ( f stRsdAmp) 

nSubbandScf [subframe] = nPrevious 

while (subframe != 7) { 

// get subframe distance to next interpolation point 
nextSubf rameDistance = getVariableParam (rsdAppx) + 1 
// get value of next interpolation point 
nNext = getVariableParam (rsdAmp) 
if (nNext & 1) 

nNext = nPrevious + ( (nNext +1) >> 1) 
else 

nNext = nPrevious - (nNext >> 1) 
// perform linear interpolation on missing values 



Table 9-18 

Table 9-18 
Table 9-18 
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syntax 


Size (Bits) 


if (nextSubf rameDistance = 2){ 














if (nNext > nPrevious) 














nSubbandScf [subframe + 1] 


= 


nPrevious + 


(nNext - nPrevious) 


>> 


1 


















nSubbandScf [subframe + 1] 


= 


nPrevious - 


(nPrevious - nNext) 


>> 


1 




} 

else if (nextSubf rameDistance = 


4 


{ 










if (nNext > nPrevious) { 














IlO LikJkJGLLl.\J.iD J_ L b LLJJi- X. CtLLLC T ± J 




ilrlCViULlb T 


V 1 UN GAL ilL7j.CVXUU.t3j 




2 




nSubbandScf [subframe + 2] 




nPrevious + 


(nNext - nPrevious) 


>> 


1 




nSubbandScf [subframe + 3] 

} 




nPrevious + 


( (nNext - nPrevious 


* 


3) >> 2 




else { 














nSubbandScf [subframe + 1] 




nPrevious - 


(nPrevious — nNext) 




2 




nSubbandScf [subframe + 2] 




nPrevious - 


(nPrevious - nNext) 


>> 


i 




nSubbandScf [subframe + 3] 

} } 




nPrevious - 


( (nPrevious - nNext 


* 


3) >> 2 




else { 














for (i = 1; i < nextSubf rameDistance; i++) { 










nSubbandScf [subframe + i] 




nPrevious + 


(nNext - nPrevious 








* i / nextSubf rameDistance 

i i 










; ; 

// final interpolation point 














nSubbandScf [subframe + nextSubf rameDistance] = 


nNext 








// move on to next subframe 














subframe += nextSubf rameDistance 












// update 














nPrevious = nNext 

} } 















where: 

fstRsdAmp, rsdAppx and rsdAmp are defined in clause 9.8.10. 



Table 9-30: Decode Grid3 



syntax 


Size (Bits) 


DecodeGrid3 ( ) { 






for (i = 0; i < 8; i++ 


){ 




nValue = ExtractBit 


s(5) 


5 


r = Grid3Codes [nVal 


ue] 




r &= 31 






if (r<31) 






return r 






if ( ExtractBits (1) 


= ) 


1 


return Oxd 






if( ExtractBits (3) 


= 3 ) 


3 


return 0x13 






if ( ExtractBits (7) 


= 1 ) 


7 


return Oxc 






if( ExtractBits (1) 


= 1 ) 


1 


return 0x14 






if ( ExtractBits (3) 


= ) 


3 


return Oxb 






if( ExtractBits (7) 


= 6 ) 


7 


return 0x15 






if( ExtractBits (15) 


= 2 ) 


15 


return Oxa 






/ / rare value 






nBits = ExtractBits 


(3) + 4 


3 


r = ExtractBits (nBi 


ts) 


nbits 


if (r > 56) 






return 16 






return r 






} } 
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where: 

Grid3Codes [32] = 



113, 48, 
113, 48, 



79, 
79, 



142 , 
142 , 



48, 
48, 



79, 48,113, 48, 
79, 48,113, 48, 



79, 
79, 



178, 48, 
191, 48, 



79, 4E 
79, 4£ 



Table 9-31 : DecodeLPC 



syntax 



Size (Bits) 



DecodeLPC (startSB, endSB) { 
nOrder = 8 

for (nSubband = startSB; nSubband < endSB; nSubband++) { 
for (nCh = ch; nCh <= (ch+1) ; nCh++) { 
if (nSubband < 2){ // LPC subbands 

for (j = 0; j < 2; j++){ // LPC frames 
if (nBitsLeft >= nOrder * 4) { 
for (i = 0; i < nOrder; i++) { 
LpcQtd[i] = ExtractBits (4) 

} } } } 
else{ 

if (nBitsLeft >= nOrder * 4) 
{ 

for (i = 0; i < nOrder; i++) { 
LpcQtd[i] = ExtractBits (4) 

}}}}}} 



Table 9-32: Decode Residual Chunks Part 2 



syntax 



Size (Bits) 



ResidualChunksPart2 (ch, nSubFrameCount ) 
{ 

nSampleOf f set = nSubFrameCount << 3 
nTotalSubbands = 8 << nFrequencyRange 
nSubFrame4 = nSubFrameCount/4 

for (nSubband = 0; nSubband < nTotalSubbands ; nSubband++) { 
if ((nSubband >= nMinMonoSubband) & (SecChPres = 0)) 

nCodedChannels = ch 
else 

nCodedChannels = ch+1 
for (nCh = ch; nCh <= nCodedChannels; nCh++) { 
nLevel = QLevels [nCh] [nSubband] [nSubFrame4] 
ModuloOrigin = 
if (nLevel = 2) { 

if (nBitsLeft >= 16) { 

if (TSCodingMethod [nCh] [nSubband] [nSubFrame4] = 0){ 
nValue = nPackedSamples [nCh] [nSubband] 
for (j = 0; j < 8; j++) { 

nCount = mod (5, nSampleOf f set - ModuloOrigin + j) 
if (nCount=0) { 

if (nBitsLeft >= 24) 

nValue = ExtractBits ( 8 ) 
else if (nBitsLeft >= 8) { 
nValue = ExtractBits ( 8 ) 
if (nValue > 242) nValue = 121 
}else 

nValue = 121 

} 

xValuetj] = 

ResidualLevels3 [ (ResidualPack5In8 [nValue] >> (nCount*2) ) &3] 
} 

nPackedSamples [nCh] [nSubband] = nValue 
}else{ 

for (j = 0; j < 8; j++) { 

nValue = ExtractBits ( 1 ) 
if (nValue = 0) 
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syntax 



xValue [ j ] =0 
else 

xValue [j] = ResidualQuantizednLevell6 [nValue] 

} } } 
}else if (nLevel = 3) { 
if (nBitsLeft >= 7) { 

nValue = nPackedSamples [nCh] [nSubband] 
if (nValue > 124) nValue = 124 
for (j = 0; j < 8; j+ + ) { 

nCount = mod (3, [nSampleOf f set - ModuloOrigin + j]) 
if (nCount=0) { 

if (nBitsLeft >= 24) 

nValue = ExtractBits ( 7 ) 
}else if (nBitsLeft >= 7) { 
nValue = ExtractBits ( 7 ) 
if (nValue > 124) nValue = G2 
}else{ 

nValue = G2 

} } 

xValue [j] = ResidualLevels5 [ResidualPack3In7 [nValue] [nCount] ] 

} 

nPackedSamples [nCh] [nSubband] = nValue 

} } 
else { 

switch (nLevel) { 
default : 
case : 

break 
case 1 : 

if (nBitsLeft >= 8) { 

nValue = ExtractBits ( 8 ) 

xValue [0] = ResidualQuantizednLevellO [nValue & 1] 
nValue >>= 1 

xValue [1] = ResidualQuantizednLevellO [nValue & 1] 
nValue >>= 1 

xValue [2] = ResidualQuantizednLevellO [nValue & 1] 
nValue >>= 1 

xValue [3] = ResidualQuantizednLevellO [nValue & 1] 
nValue >>= 1 

xValue [4] = ResidualQuantizednLevellO [nValue & 1] 
nValue >>= 1 

xValue [5] = ResidualQuantizednLevellO [nValue & 1] 
nValue >>= 1 

xValue [6] = ResidualQuantizednLevellO [nValue & 1] 
nValue >>= 1 

xValue [7] = ResidualQuantizednLevellO [nValue & 1] 

} 

break 
case 4 : 

for (j=0; j < 8; j++) { 
if (nBitsLeft >= 24) 

xValue [j] = ResidualLevels8 [nCodeVal [nValue & 63]] 

} 

break 
case 5 : 

for (j=0; j < 8; j++) { 
if (nBitsLeft >= 4) 

xValue [j] = ResidualLevelsl6 [ExtractBits (4) ] 

} 

break 
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where: 

The ResidualLevels and ResidualQuantizednLevel tables are in clause 9.8.5. 

mod(x,y) is the modulo of y to the rank of x, so mod(x,y) is the integer remainder of y/x 



ResidualPack5In8 [256] = 
{ 



0x0000 , 


0x0100 , 


0x0200 , 


0x0040 , 


0x0140 , 


0x0240 , 


0x0080 , 


0x0180 , 


0x0280 , 


0x0010 , 


0x0110 , 


0x0210 , 


0x0050 , 


0x0150 , 


0x0250 , 


0x0090 , 


0x0190 , 


0x0290, 


0x0020, 


0x0120, 


0x0220, 


0x0060 , 


0x0160 , 


0x0260 , 


OxOOaO , 


OxOlaO , 


0x02a0 , 


0x0004 , 


0x0104 , 


0x0204, 


0x0044 , 


0x0144 , 


0x0244 , 


0x0084 , 


0x0184 , 


0x0284 , 


0x0014 , 


0x0114 , 


0x02 14 , 


0x0054 , 


0x0154 , 


0x0254 , 


0x0094 , 


0x0194 , 


0x02 94 , 


0x0024 , 


0x0124 , 


0x0224 , 


0x0064 , 


0x0164 , 


0x0264 , 


OxOOa4 , 


Ox01a4 , 


0x02a4 , 


0x0008 , 


0x0108 , 


0x0208, 


0x0048 , 


0x0148 , 


0x0248 , 


0x0088 , 


0x0188 , 


0x0288 , 


0x0018 , 


0x0118 , 


0x0218 , 


0x0058 , 


0x0158 , 


0x0258 , 


0x0098 , 


0x0198 , 


0x0298 , 


0x0028, 


0x0128, 


0x0228, 


0x0068, 


0x0168, 


0x0268, 


0x00a8, 


0x01a8, 


0x02a8, 


0x0001, 


0x0101, 


0x0201, 


0x0041, 


0x0141, 


0x0241, 


0x0081, 


0x0181, 


0x0281, 


0x0011, 


0x0111, 


0x0211, 


0x0051, 


0x0151, 


0x0251, 


0x0091, 


0x0191, 


0x0291, 


0x0021, 


0x0121, 


0x0221, 


0x0061, 


0x0161, 


0x0261, 


OxOOal, 


OxOlal, 


0x02al , 


0x0005, 


0x0105, 


0x0205, 


0x0045, 


0x0145, 


0x0245, 


0x0085, 


0x0185, 


0x0285, 


0x0015, 


0x0115, 


0x0215, 


0x0055, 


0x0155, 


0x0255, 


0x0095, 


0x0195, 


0x0295, 


0x0025, 


0x0125, 


0x0225, 


0x0065, 


0x0165, 


0x0265, 


0x00a5, 


Ox01a5, 


Ox02a5, 


0x0009, 


0x0109, 


0x0209, 


0x0049, 


0x0149, 


0x0249, 


0x0089, 


0x0189, 


0x0289, 


0x0019, 


0x0119, 


0x0219, 


0x0059, 


0x0159, 


0x0259, 


0x0099, 


0x0199, 


0x0299, 


0x0029, 


0x0129, 


0x0229, 


0x0069, 


0x0169, 


0x0269, 


0x00a9, 


0x01a9, 


0x02a9, 


0x0002 , 


0x0102 , 


0x0202 , 


0x0042 , 


0x0142 , 


0x0242 , 


0x0082 , 


0x0182 , 


0x0282 , 


0x0012 , 


0x0112 , 


0x0212 , 


0x0052 , 


0x0152 , 


0x0252 , 


0x0092 , 


0x0192 , 


0x0292 , 


0x0022 , 


0x0122 , 


0x0222 , 


0x0062 , 


0x0162 , 


0x0262 , 


0x00a2 , 


Ox01a2, 


0x02a2 , 


0x0006, 


0x0106, 


0x0206, 


0x0046, 


0x0146, 


0x0246, 


0x0086, 


0x0186, 


0x0286, 


0x0016, 


0x0116, 


0x0216, 


0x0056, 


0x0156, 


0x0256, 


0x0096, 


0x0196, 


0x0296, 


0x0026, 


0x0126, 


0x0226, 


0x0066, 


0x0166, 


0x0266, 


0x00a6, 


0x01a6, 


0x02a6, 


0x000a, 


0x010a, 


0x020a, 


0x004a, 


0x014a, 


0x024a, 


0x008a, 


0x018a, 


0x028a, 


0x001a, 


0x011a, 


0x021a, 


0x005a, 


0x015a, 


0x025a, 


0x009a, 


0x019a, 


0x029a, 


0x002a, 


0x012a, 


0x022a, 


0x006a, 


0x016a, 


0x026a, 


OxOOaa, 


OxOlaa, 


0x02aa, 


0x0003 , 


0x0103 , 


0x0203 , 


0x0043 , 


0x0143 , 


0x0243 , 


0x0083 , 


0x0183 , 


0x0283, 


0x0013 , 


0x0113 , 


0x0213 , 


0x0053 



















} 



ResidualPack3In7 [128] [3] = 



{ 





0, 


} , 


{ o. 


0, 


1 } , 


{ o. 


0, 


2 } / 


{ o. 


0, 


3 } . 


{ o. 


0, 


4 } / 


{ o. 


1, 


} , 


{ o. 


1, 


1 


. o. 


1, 


2 } , 


{ o. 


1, 


3 } . 


{ o. 


1, 


4 } / 


{ o. 


2, 


} , 


{ o. 


2, 


1 } , 


{ o. 


2, 


2 } / 


{ o. 


2, 


3 


0, 


2, 


4 } , 


{ o. 


3, 


} , 


{ o. 


3, 


1 } , 


{ o. 


3, 


2 } / 


{ o. 


3, 


3 } . 


{ o. 


3, 


4 } / 


{ o. 


4, 





o. 


4, 


1 } , 


{ o. 


4, 


2 } / 


{ °. 


4, 


3 } . 


{ o. 


4, 


4 } / 


{ !> 


0, 


} , 


{ !. 


0, 


1 } , 


{ !. 


0, 


2 


' 1, 


0, 


3 } - 


{ 1 i 


0, 


4 } , 


{ !. 


1, 


} , 


{ !> 


1, 


1 } , 


{ !. 


1, 


2 } / 


{ !< 


1, 


3 } - 


{ !> 


1, 


4 


' 1, 


2, 


} , 


{ !> 


2, 


1 } , 


{ !. 


2, 


2 } / 


{ !. 


2, 


3 } - 


{ !. 


2, 


4 } / 


{ !> 


3, 


} , 


{ !. 


3, 


1 


' 1, 


3, 


2 } , 


{ !> 


3, 


3 } - 


{ !- 


3, 


4 } / 


{ !. 


4, 


} , 


{ !< 


4, 


1 } , 


{ !> 


4, 


2 } / 


{ !< 


4, 


3 


1, 


4, 


4 } , 


{ 2 , 


0, 


} , 


{ 2, 


0, 


1 } , 


{ 2, 


0, 


2 } / 


{ 2, 


0, 


3 } . 


{ 2, 


0, 


4 } / 


{ 2, 


1, 





. 2, 


1, 


1 } , 


{ 2 , 


1, 


2 } / 


{ 2, 


1, 


3 } . 


{ 2, 


1, 


4 } / 


{ 2, 


2, 


} , 


{ 2, 


2, 


1 } , 


{ 2, 


2, 


2 


. 2, 


2, 


3 } . 


{ 2, 


2, 


4 } , 


{ 2, 


3, 


} , 


{ 2, 


3, 


1 } , 


{ 2, 


3, 


2 } / 


{ 2, 


3, 


3 } . 


{ 2, 


3, 


4 


2, 


4, 


} , 


{ 2, 


4, 


1 } , 


{ 2, 


4, 


2 } / 


{ 2, 


4, 


3 } . 


{ 2, 


4, 


4 } / 


{ 3 - 


0, 


} , 


{ 3 - 


0, 


1 


3, 


0, 


2 } , 


{ 3 - 


0, 


3 } - 


{ 3 - 


0, 


4 } / 


{ 3 - 


1, 


} , 


{ 3 - 


1, 


1 } , 


{ 3 - 


1, 


2 } / 


{ 3 - 


1, 


3 


; 3, 


1, 


4 } , 


{ 3 ' 


2, 


} , 


{ 3 - 


2, 


1 } , 


{ 3 - 


2, 


2 } / 


{ 3 - 


2, 


3 } . 


{ 3 - 


2, 


4 } / 


{ 3 ' 


3, 





; 3, 


3, 


1 } , 


{ 3 - 


3, 


2 } / 


{ 3 - 


3, 


3 } - 


{ 3 - 


3, 


4 } / 


{ 3 - 


4, 


} , 


{ 3 - 


4, 


1 } , 


{ 3 - 


4, 


2 


; 3, 


4, 


3 } . 


{ 3 - 


4, 


4 } , 


{ 4 , 


0, 


} , 


{ 4, 


0, 


1 } , 


{ 4, 


0, 


2 } / 


{ 4, 


0, 


3 } . 


{ 4, 


0, 


4 


4, 


1, 


} , 


{ 4 , 


1, 


1 } , 


{ 4, 


1, 


2 } / 


{ 4, 


1, 


3 } . 


{ 4, 


1, 


4 } / 


{ 4, 


2, 


} , 


{ 4, 


2, 


1 


' 4, 


2, 


2 } , 


{ 4 , 


2, 


3 } . 


{ 4, 


2, 


4 } / 


{ 4, 


3, 


} , 


{ 4, 


3, 


1 } , 


{ 4, 


3, 


2 } / 


{ 4, 


3, 


3 


' 4, 


3, 


4 } , 


{ 4 , 


4, 


} , 


{ 4, 


4, 


1 } , 


{ 4, 


4, 


2 } / 


{ 4, 


4, 


3 } . 


{ 4, 


4, 


4 } / 


{ 5 - 


0, 





; 5, 


0, 


1 } , 


{ 5 - 


0, 


2 } 

































} 



nCodeVal [64] = 
{ 

1, 2, 4, 3, 5, 2, 0, 3, 1, 2, 4, 3, 6, 2, 0, 3, 1, 2, 4, 3, 5, 2, 0, 3, 1, 2, 4, 3, 7, 2, 0, 3, 
1, 2, 4, 3, 5, 2, 0, 3, 1, 2, 4, 3, 6, 2, 0, 3, 1, 2, 4, 3, 5, 2, 0, 3, 1, 2, 4, 3, 4, 2, 0, 3 

} 

9.5.2.2.1 Quantization Profiles 

The amount of bits available to store (quantized) information on individual values of subband samples varies from 
frame to frame. The actual amount depends on many variables: bitrate, number of bits occupied by tonal information, 
number of bits occupied by residual scalefactor grids. In these conditions it is feasible to use individual quantization 
profile in each frame. For multichannel files, a quantization profile can be selected individually for each channel pair. 
The quantization profile is characterized by 8 bit value with the following fields: 

• 3 bits - overall level; 

• 2 bits - steepness; 

• 3 bits - subband with maximum energy out of the first 8 subbands. 
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Quantization level for a given subband can be calculated with the following formulas: 



SNR 



18000 



12 X nSubband x SampleRate 



+ 100 + 40x5 



+ 20xL 



1000 xTotalSubbands 
where: 

SNR - desired signal-to-noise ratio for given subband, in dB x 10 
SampleRate - sampling rate in Hz; 
ST - steepness; 
OL - overall level. 

A quantizer is then selected that give SNR value closest to calculated by the equation above according to: 

Table 9-33: Quantizer levels 



SNR, dB x 10 


Number of levels in quantizer 


<95 


2 


> 95 and < 140 


3 


> 140 and < 180 


5 


> 180 and < 230 


8 


>230 


16 



9.5.2.2.2 



Scale Factor Processing 



At the encoder, the initial scale factor grid is referred to as the high resolution grid and is constructed by selecting the 
larger amplitude out of each pair of successive samples. This results in 32 bands by 64 scale factors per primary 
channel, but this grid is not transmitted due to its large size. Instead, 3 separate lower resolution grids are derived from 
it and encoded for packing into the bit stream. 

At the decoder, the low resolution grids are decoded, re-assembled and then used to reconstruct a high resolution scale 
factor grid for sample scaling. If the 'high resolution grids' chunk is present, it is possible to construct scalefactors with 
better resolution. This chunk contains the 1 st subband from Gridl which should be processed first. Both Grid2 and 
Grid3 information is used to correct the results calculated using only Gridl information. That is, Grid2 and Grid3 
contain difference information for the Gridl information. 

Below is a summary of the frequency and time resolution of each grid at 44,1/48 KHz, (refer to the appropriate tables to 
derive 96 KHz values.) 

Grid 1 has 10 bands of 8 scale factors, (see clause 9.8.4, Gridl mapping table, for details). It should be noted that a skip 
and interpolate technique is also used at the encoder to improve coding efficiency, so the actual number of Gridl factors 
in the stream could be less than 10 by 8: 

Bands to 3 correspond to frequency subband to 3 respectively 

Band 4: frequency subbands 4, 5 

Band 5: frequency subbands 6 to 9 

Band 6: frequency subbands 7 to 12 

Band 7: frequency subbands 10 to 17 

Band 8: frequency subbands 14 to 23 

Band 9: frequency subbands 19 to 29 
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Grid 2 has 3 bands of 64 scale factors: 

• Band 1: frequency subbands 4 to 9 

• Band 2: frequency subbands 10 to 17 

• Band 3: frequency subbands 18 to 31 
Grid 3 has 26 bands of 8 scale factors: 

• Mapping to frequency subbands 4 to 29 

9.5.2.2.3 Decoding of Grid 1 scale factors 

Gridl has overlapping subbands (as opposed to Grid2 and Grid3). The bitstream contains amplitudes of selected Gridl 
scalefactors and the distances between them. Any factors missing from the bit stream should be linearly interpolated 
from these values to produce 8 scalefactors per subband. 

These values should further be mapped to the high-resolution grid using the weights from the tables in clause 9.8.4. The 
Gridl chunk in the bit stream has information about all the Gridl subbands except the first and second subband because 
for coding at low bitrates where there are no high resolution grids and no TimeSamples chunks in the bitstream, this 
subband is always zero. 

9.5.2.2.4 Decoding of Grid 2 scale factors 

Grid2 scalefactors are stored in groups of 8. There is a lbit flag stored before each group - if this flag is - the whole 
group has values. Otherwise individual values are stored. These values should be subtracted from the high-resolution 
grid. Grid2 has the same time resolution as the high-resolution grid, but there are only three subbands. These three 
subbands are mapped without weighting to high-resolution bands 4 to 7; 8 to 15; and 16 to 31 respectively. 

9.5.2.2.5 Decoding of Grid 3 scale factors 

Grid3 has the same frequency resolution as the high-resolution grid but has low time-resolution - there are only 8 time 
intervals. Since Gridl already has the highest possible frequency resolution for the lowest 4 subbands, there is no data 
for these subbands in Grid3. Due to this, Grid3 contains only 28 subbands which should be mapped to subbands 4 to 31 
of the high resolution grid. 

9.5.2.3 Reconstruction of Hi resolution scale factors grid 

High-resolution scalefactors are constructed from Gridl and 'high resolution grids' chunks. In this step all the grids 
obtained from the bitstream are mapped onto a grid of scalefactors which has enough time and frequency resolution to 
include information from all the grids. A grid 64 time intervals is sufficient. In the clauses that follow, this grid will be 
referred as the 'high-resolution grid'. 

9.5.2.4 LPC synthesis 

The synthesis process contains the following steps (for each subband): 

1) Getting reflection coefficients (LpcQtd[] in Table 9-31) from bitstream. 

2) Dequantizing coefficients to direct linear representation. 

3) Running predictor over the samples in a form of IIR filter. 

For each primary channel, prediction is applied to the first two subbands only and each frame is divided into two blocks 
for processing. So there would be two groups of 8 coefficients per subband in the bit stream and each set should be used 
to synthesize samples for half of the frame. 
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9.5.2.5 Timesamples Processing 

There are several methods for packing time sample information into the bitstream. These methods are indicated with 
flags that apply to a single subband for the entire residual frame. 

• If nLRMS (Mid/Side)is set, the time samples are used for both channels. 

• If nSubbandMidSide is set in conjunction with nLRMS , the time samples for the right channel should be 
inverted. 

• The TSCodingMethod flag determines whether time samples are packed using Huffman codes or packed 
directly as 5 values in 8 bits for 3-level quantization or as 3 values in 7 bits for 5-level quantization. This flag 
is effective only for 3- and 5-level quantization. 9- and 8-level quantization always uses Huffman codes. 2- 
level quantization always uses direct packing (lbit/sample). 

De-quantized values are described in the tables found in clause 9.8.5. 

The quantization levels allocation scheme used to quantize the time-samples depends on quantization profile that is 
selected individually for each frame and each channel pair. A procedure of assignment of quantization levels to 
subbands is described in clause 9.5.2.2.1 

When no information about specific sample value is found in the bitstream, a pseudo-random value with the range 
[-1...1] is substituted. This process is referred to as white noise substitution. 

After de-quantization, the time samples are scaled with the high-resolution scale factors (see clause 9.5.2.2.2). There are 
2 samples for each scale factor in the high-resolution scale factors grid. Each time sample should be multiplied by the 
linear scale factor value obtained from the Grid 1, 2 and 3 scale factor extraction. 



9.5.3 Inverse Filterbank 

The inverse filterbank is constructed of evenly spaced critically downsampled subbands. The filterbank is a hybrid 
structure performed with the following steps: 

1) Windowing input data in each subband 

8-point forward MDCT 

Grouping 8-point MDCT results into a single set of MDCT coefficients 

Aliasing cancellation for high frequencies, described by the following pseudo-code: 

ALl = 0.30865828381746 

AL2 = 0.03806023374436 

a = mdct_band[i] [3] * ALl 

b = mdct_band [i+1] [0] * ALl 

mdct_band [i] [3] = mdct_band [i] [3] + b-a 
mdct_band [i + 1] [0] = mdct_band [i + 1] [0] + b+a 
a = mdct_band [i] [2] * AL2 
b = mdct_band [i+1] [1] * AL2 

mdct_band [i] [2] = mdct_band [i] [2] + b-a 
mdct_band [i + 1] [1] = mdct_band [i+1] [1] + b+a 

Where mdct band [i] is i th band of MDCT coefficients, and the short window filter is shown in Table 9-34. 



Table 9-34: Short window filter 



0,02281089288256 
0,41799773023326 
0,90844807089885 
0,99973979773034 
0,99973979773034 
0,90844807089885 
0,41799773023326 
0,02281089288256 



2) N-point inverse MDCT. N depends on sample-rate as shown in Table 9-19 
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3) Window (Long Window in clause 9.8.6) 

4) Overlap-add. 

This inverse filterbank structure allows an optimized implementation to use the same filterbank for both residual coding 
and tonal components reconstruction, which speeds-up the decoding process considerably. 

9.5.4 LFE Chunk 

The LFE Chunk is used to store ADPCM-encoded LFE channel samples. 



Table 9-35: LFE Chunk 



Syntax 


Size (Bits) 


LFEChunk ( ) 




{ 

if (chunkID != LFE_ADPCM_ID) 


8 


return 




ChunkLengthlnf o { ) 


Table 9-9 


DecodeLFE { ) 




ByteAlign { ) 

} 


0..7 



Table 9-36: Decode LFE 



Syntax 


Size (Bits) 


DecodeLFE ( ) { 




InitLFE ( ) 


Table 9-37 


vp = 




nsamples = 4 




to Ltr^J J. 11 Lit: A. — U 




if (nSampleRate<14000) { 




upsampleFactor = 1G 




}else if (nSampleRate<28000) { 




upsampleFactor = 32 




}else if (nSampleRate<50000) { 




upsampleFactor = 64 




}else{ 




upsampleFactor = 12 8 

} 




nScale = upsampleFactor * 0x7fff 




vp = predictedSamplelnit 




stepindex = stepSizelndexInit 




if (bLFEinput24Bit) { 




stepsize = lfe StepSizeTable24 [stepindex] 




for (nsample = 0; nsample<nsamples ; nsample++) { 




code = ExtractBits ( S ) 


6 


// Calculate predicted delta as ( (code+0 . 5) *step) /16 




pdelta = stepsize/32 




if (code & 16) 




pdelta += stepsize 




if (code & 8) 




pdelta += stepsize/2 




if (code & 4) 




pdelta += stepsize/4 




if (code & 2) 




pdelta += stepsize/8 




if (code & 1) 




pdelta += stepsize/16 




// Update predicted value 




if (code & 32) { 




vp -= pdelta 




if (vp < -1.3f) 




vp = -1.3f 




}else{ 
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Syntax 


Size (Bits) 


vp += pdelta 
if (vp > 1.3f) 
vp = 1.3f 

} 

// Adjust step size 

stepindex += If e_DeltaIndex24 [code & 31] 
if (stepindex < 0) { 

stepindex = 
}else if (stepindex > 143) { 

stepindex = 143 

} 

stepsize = lfe StepSizeTable24 [stepindex] 
// Output data 

LFEdata [nsample] = vp * nScale 

} } 

else 

{ 

stepsize = lfe StepSizeTablel6 [stepindex] 
for (nsample = 0; nsample<nsamples ; nsample++) { 
code = ExtractBits (4) 

// Calculate predicted delta as ( ( code+0 . 5 ) *step) /4 
pdelta = stepsize/8 
if (code & 4) 

pdelta += stepsize 
if (code & 2) 

pdelta += stepsize/2 
if (code & 1) 

pdelta += stepsize/4 
// Update predicted value 
if (code & 8) { 

vp -= pdelta 

if (vp < -1.3f) 
vp = -1.3f 

}else{ 

vp += pdelta 
if (vp > 1.3f) 
vp = 1.3f 

} 

// Adjust step size 

stepindex += lfe Deltalndexl6 [code & 7] 
if (stepindex < 0) { 

stepindex = 
Jelse if (stepindex > 100) { 

stepindex = 100 

} 

stepsize = lfe StepSizeTablel6 [stepindex] 
// Output data 

LFEdata [nsample] = vp * nScale 

} } 


4 



where: 

lfe_DeltaIndexl6 [] and If e_DeltaIndex24 [] are in clause 9.8.7, 
lfe_StepSizeTablel6 [] and If e_StepSizeTable 24 [] are in clause 9.8.8 



Table 9-37: Init LFE Decoding 



Syntax 


Size (Bits) 


InitLFE ( ) { 




// determining input bit depth from chunk size 




bLFEinput24Bit = (bLBRCompressedFlags && LBR_FLAG_24_BIT_SAMPLES) 




if (bLFEinput24Bit) { 




int ps = ExtractBits (8) 


8 


int_ps |= ExtractBits (16) << 8 


16 
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Syntax 

ps_mult = 0x007fffff 

if (int_ps & 0x00800000) { 

int_ps &= 0x007fffff 

ps_mult = -ps_mult 

} 

}else{ 

int_ps = ExtractBits ( 16 ) 

ps_mult = 0x007fff 

if (int_ps & 0x008000) { 

int_ps &= 0x007fff 

ps_mult = -ps_mult 

} 

} 

predictedSamplelnit = int_ps/ps_mult 
stepSizelndexInit = ExtractBits ( 8 ) 



Size (Bits) 



16 



The number of bits used for Deitavalue and startstepsizeindex depends on the source data resolution: 3 bits for 16- 
bit source, 5 bits for 24-bit source. 

9.5.4.1 LFE decoding 

LFE decoding is performed whenever LFE channel information is present in the bitstream. ADPCM -encoded LFE 
coefficients are extracted from the bitstream and ADPCM synthesis is performed. A further upsampling is required to 
match the LFE channel sample rate to the rest of the channels: 

• 64 times for sample rates < 48 kHz; 

• 128 times for sample rates > 48 kHz and < 96 kHz; and 

• 256 times for sample rates > 96 kHz. 
ADPCM synthesis process: 

ADPCM samples are stored to the bitstream as fixed-length numbers. 4 bits are used for 16-bit source samples and 
6 bits are used for 24-bit samples. A simple ADPCM method, linearly predicting DPCM step size from the difference 
between previous two samples is implemented. 

The current sample value V is calculated by one of the following formula: 
v = Vp + {-\)Sign x {C ° de + °' 5) X Ste P Size for 16-bit samples 

V=v P + (- 1) Sign x {C ° de + °' 5) X Ste P Size for 24-bit samples 

where: 

Code is the part of encoded sample which shows the absolute ratio of the difference between the current and the 
previous samples to StepSize. 

Sign is the sign of the difference between the current and the previous samples. 

Vpis the value of the previous decoded sample. 

StepSize is the step size 

StepSizelndexis the index of the StepSize'm the Step Size Table. 
At the end of the step, new Vp and StepSize values are calculated: 
Vp'= V. 

StepSizelndex '= StepSize/ndex+ DeltalndexTablefCode]. 
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StepSize '= StepSizeTable[StepSizeIndex'] . 
See the appropriate tables in clause 9.8.7 for DeltalndexTable values and in clauses 9.8.8 for StepSize table values. 
StepSizelndex and Vp are read at the start of each frame to ensure frame-drop tolerance. 
Initial StepSize\s StepSizeTable[0]. 

9.5.5 Embedded Channel Sets Chunk 

The Embedded Channel Sets (ECS) chunk holds residual samples inter-channel replacement information in the case of 
stereo downmix (LBR_FLAG_STEREO_DOWNMIX is set) and downmix scaling and contribution coefficients in the 
case of multichannel downmix (LBR_FLAG_MULTICHANNEL_DOWNMIX is set). 

9.5.5.1 Embedded channel sets 

Whenever the embedded channel sets chunk is present in the bitstream, additional operations are needed to extract the 
original channels. The required information is obtained directly from the embedded channel sets chunk. 

There are two possible scenarios: 

• Decoders capable of decoding no more than 2 channels: will decode the first two channels, ignoring irrelevant 
information in the tonal chunk and irrelevant residual chunks. 

• Decoders capable of decoding 5.1 channels: will decode 5 channels plus the LFE channel. 



9.5.5.2 



Stereo downmix case 



Whenever a non-zero number is present for replacementChannel in ECSChunk(), a corresponding number of 
timesamples have to be taken from that replacementChannel (which is one of the downmixed channels). For each 
channel and for each subband there is a single entry in ECSChunk which covers a quarter of a frame e.g. four 
replacementChannel values per frame duration. 

Table 9-38: Embedded Channel Set Chunk 



Syntax 



Size (Bits) 



ECSChunk ( ) 



if (chunklD != EmbLevels_ID) 

return 
ChunkLengthlnf o ( ) 
if (LBR_FLAG_USE_LFE) 

nFullbandChannels = nChannels + 1 

else 

nFullbandChannels = nChannels 
// there are two types of information in this chunk: 
// - replacement information for stereo downmix, or 
// - scaling/contribution info for multichannel downmix 
if (replacementPair) { 

stSB = ExtractBits (7) 

enSB = ExtractBits (7) 

for (c = 0; c < nChannels * nTotalSubbands * 4; ++c) { 
nValue = ExtractBits ( 1 ) 
if (nValue) { 

canReplace = c 

nValue = ExtractBits ( 1 ) 

if (nValue) { 

replaceChannel = c 

} } } } 



8 

Table 9-9 



else 



i f { LBR_FLAG_MULTI CHANNELJDOWNMIX ) 
bMultichannelDownMix = TRUE 

else 

bMultichannelDownMix = FALSE 
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Syntax 



Size (Bits) 



if <LBR_FLAG_CS_IS_LAST_DOWNMIX) 
nCsIsLast = 1 

else 

nCsIsLast = 

nContChAenc = ExtractBits (4 ) 
nContChBenc = ExtractBits (4 ) 

if (nChannels >= 2 + 6) 

nContChA = ContChOrderForDecode [nFullbandChannels-2-6] [nChannels - 
nFullbandChannels] [nContChAenc] 

if (nContChBenc >= (nFullbandChannels + (bMultichannelDownMix) ? : 2 ) ) { 

nContChB = -1 

nContChBenc = -1 
}else{ 

// there is no need to compute indexes of contributed channels for decoder 
// if we're only decoding 5.x (or 4.x) embedded downmix 
if (nFullbandChannels >= 2+6) 

nContChB = ContChOrderForDecode [nFullbandChannels-2-6] [nChannels - 
nFullbandChannels] [nContChBenc] 
} 

// Read 4 contribution coeffs 
for (c = 0; c < 4- (2*nCsIsLast) ; C++) { 
nDMixCoeff Index = ExtractBits ( 6 ) 

DMixContributedCoef f s [c] = DMixContribution_IndexTodB [nDMixCoeff Index] 

} 

// Read 1 or all down-mix channels scaling coeffs 
nValue = ExtractBits ( 1 ) 
if (nValue) { 

for (c=0; c < nChannels - ( (bMultichannelDownMix) ?4 : ) ; C++) { 
nDMixCoeff Index = ExtractBits (5) 

DMixScalingCoef f s [c] = DMixContribution_IndexTodB [nDMixCoeff Index] 
DMixScalingCoef f s_dB [c] = DMixScalingCoef fs [c] 



if( (DMixScalingCoef fs [c] = -200. Of) 
DMixScalingCoef fs [c] = l.Of 
DMixScalingCoef fs_K [c] = l.Of 

}else{ 

DMixScalingCoef fs [c] = pow(10.0f 



| (DMixScalingCoef fs [c] = O.Of) 



-DMixScalingCoef fs [c] / 20. Of) 



DMixScalingCoef fs_K [c] = DMixScalingCoef fs [c] 



else 



nDMixCoeff Index = ExtractBits ( 5 ) 

DMixScalingCoef fs [0] = DMixScaling_IndexTodB [nDMixCoeff Index] 
for (c=0; c < nChannels - ( (bMultichannelDownMix) ?4 : ) ; C++) 

DMixScalingCoef fs_dB [c] = DMixScalingCoef fs [0] 
if( (DMixScalingCoef fs [0] = -200. Of) | (DMixScalingCoef fs [0] : 

DMixScalingCoef fs [0] = l.Of 

DMixScalingCoef fs_K [0] = DMixScalingCoef fs [0] 



O.Of) ) { 



else { 

DMixScalingCoef fs [0] =pow(10.0f, -DMixScalingCoef fs [0] / 20. Of) 
DMixScalingCoef fs_K [0] = DMixScalingCoef fs [0] 

} 

for (c=l; c < nChannels - ( (bMultichannelDownMix) ?4 : ) ; C++) { 
DMixScalingCoef fs [c] = DMixScalingCoef fs [0] 
DMixScalingCoef fs_K [c] = DMixScalingCoef fs [0] 



// LFE channel 

if (nFullbandChannels 



nChannels ) 



lfecoeff = DMixScalingCoef fs [nChannels - ( (bMultichannelDownMix) ?4 : )- 1] 
lfecoeffK = DMixScalingCoef fs_K [nChannels - ( (bMultichannelDownMix) ?4 : )- 1] 
lfecoef f_dB = DMixScalingCoef f sdB [nChannels - ( (bMultichannelDownMix) ?4 :0) -1] 
for (c = nChannels - ( (bMultichannelDownMix) ?4 : )- 1 ; c > 3; --c){ 

DMixScalingCoef fs [c] = DMixScalingCoef fs [c-1] 

DMixScalingCoef fs_K [c] = DMixScalingCoef fs_K [c-1] 
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Syntax 


Size (Bits) 


DMixScalingCoef f s dB [c] = DMixScalingCoef f s dB[c-l] 

} 

DMixScalingCoef fs [3] = lfecoef f 
DMixScalingCoef fs K[3] = lfecoeff K 
DMixScalingCoef fs dB[3] = lfecoeff dB 

} } 

ByteAlign ( ) 

} 


0..7 



where: 

DMixScaiing_indexTodB [] and DMixContribution_indexTodB [] are presented in clause 9.8.9 

ContChOrderForDecode [2] [2] [6] = 
{ 

// Number of Dmix channels == 6 
{ 

//no LFE 

{0, 1, 2, 3, 4, 5}, 

// LFE exists 

{0, 1, 2, 4, 3, 5} 

}, 

// Number of Dmix channels == 7 
{ 

// no LFE 

{0, 1, 4, 2, 3, 5}, 

// LFE exists 

{0, 1, 4, 5, 2, 3} 

} } 



9.6 Program Associated Data Chunk 

The Program Associated Data Chunk can be used to store any data that is associated with the audio frame but is not 
required for decoding of the audio frame. 



Table 9-39: Pad Chunk 



Syntax 


Size (Bits) 


padChunk ( ) 
{ 

chunkID == padID 
ChunkLengthlnf o ( ) 
ProgramAssociatedData 
ByteAlign { ) 

} 


8 

Table 9-9 
variable 
0..7 



9.7 Null Chunk 

The Null Chunk is used to pad out the parent chunk. It is typically used when the parent chunk needs to be a constant 
size and the child chunks vary in size. The Null Chunk is different from other chunks in that the chunkLength field 
is optional if the Null Chunk is the last chunk within the parent chunk. It is recommended that the data field be filled 
with zero bytes. 



Table 9-40: Null Chunk 


Syntax 


Size (Bits) 


nullChunk ( ) 






{ 

NullID 

[chunkLength] 
NullData 

} 




8 

16 
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9.8 



Tables 



9.8.1 Quantized Amplitude to Linear Amplitude Conversion 

Table 9-41 : Quantized Amplitude to Linear Amplitude Conversion 



Quantized 


Linear 





0,17678 


1 


0,42678 


2 


0,60355 


3 


0,85355 


4 


1,20711 


5 


1,68359 


6 


2,375 


7 


3,36719 


8 


4,75 


9 


L 6,73438 


10 


9,5 


11 


13,46875 


12 


19 


13 


26,9375 


14 


38 


15 


53,875 



Quantized 


Linear 


16 


76 


17 


107,75 


18 


152 


19 


215,5 


20 


304 


21 


431 


22 


608 


23 


862 


24 


1216 


25 


1724 


26 


2432 


27 


3448 


28 


4864 


29 


6896 


30 


9728 


31 


13792 



Quantized 


Linear 


32 


19456 


33 


27584 


34 


38912 


35 


55168 


36 


77824 


37 


110336 


38 


1 55648 


39 


220672 


40 


311296 


41 


441 344 


33 


27584 


42 


622592 


43 


882688 


44 


1245184 


45 


1765376 


46 


2490368 
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9.8.2 Wave synthesis envelope table 

Table 9-42: Wave synthesis envelope table 



index 


wavSynEnvelope 





0,00240763666390 


1 


0,00960735979838 


2 


0,02152983213390 


3 


0,03806023374436 


4 


0,05903936782582 


5 


0,08426519384873 


6 


0,11349477331863 


7 


0,14644660940673 


8 


0,18280335791818 


9 


0,22221488349020 


10 


0,26430163158700 


11 


0,30865828381746 


12 


0,35485766137277 


13 


0,40245483899194 


14 


0,45099142983522 


15 


0,50000000000000 


16 


0,54900857016478 


17 


0,59754516100806 


18 


0,64514233862723 


19 


0,69134171618254 


20 


0,73569836841300 


21 


0,77778511650980 


22 


0,81719664208182 


23 


0,85355339059327 


24 


0,88650522668137 


25 


0,91573480615127 


26 


0,94096063217418 


27 


0,96193976625564 


28 


0,97847016786610 


29 


0,99039264020162 


30 


0,99759236333610 


31 


1 ,00000000000000 



index 


wavSynEnvelope 


32 


0,99759236333610 


33 


0,99039264020162 


34 


0,97847016786610 


35 


0,96193976625564 


36 


0,94096063217418 


37 


0,91573480615127 


38 


0,88650522668137 


39 


0,85355339059327 


40 


0,81719664208182 


41 


0,77778511650980 


42 


0,73569836841300 


43 


0,69134171618255 


44 


0,64514233862723 


45 


0,59754516100806 


46 


0,54900857016478 


47 


0,50000000000000 


48 


0,45099142983522 


49 


0,40245483899194 


50 


0,35485766137277 


51 


0,30865828381745 


52 


0,26430163158700 


53 


0,22221488349020 


54 


0,18280335791818 


55 


0,14644660940673 


56 


0,11349477331863 


57 


0,08426519384873 


58 


0,05903936782582 


59 


0,03806023374436 


60 


0,02152983213390 


61 


0,00960735979838 


62 


0,00240763666390 


63 
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9.8.3 Base function synthesis correction coefficients 



Table 9-43: Base function synthesis correction coefficients 





Spectral line offset 


F dlt 


.5 


-4 


.3 


-2 


.\ 


o 


1 


2 


3 


4 


5 


o 


-0,01 179 


0,04281 


0,4671 2 


46345 


-3 94525 


3 94525 


-0 46345 


-0,4671 2 


-0,04281 


0,01 179 


-0 00299 


1 


-0 00929 

W , W W Wfc. J 


0488? 


0,45252 


0,37972 


-8 85446 

W 3 UJ^U 


4 03189 


-0 55069 

W 3 wwww w 


-0,4804 


-0 03599 


0,01445 


-0 002P9 

W . W W <— c w 


2 


-0 00696 


05408 


0,43674 


?99K1 

W , £. W W W 1 


-3 75975 

W 3 / W J / w 


4,1 1413 


-0 64135 

W , W^ 1 WW 


-0,49221 


-0 02834 


0,01726 


-0 001 56 

W , W W 1 WW 


3 


-0 00481 


05847 

W i WWWt^ 1 


41998 

W j i 1 ^JJ 


0,2231 9 


-3 66138 

W , WW 1 wu 


4,19175 


-0 73529 

W , / W Jl W 


-0,50241 


-0 01 983 

W , W 1 www 


0,02021 


-0 0008 

W , WWWW 


4 


-0 00284 

W , W W £- W^ 


0621 6 

W , \J\JC- I W 


0,40224 


15053 

W , 1 WWW w 


-3 55963 

w , w w w w w 


4,26452 


-0 83239 

W 3 w w c w w 


-0 51085 

W 3 W 1 www 


-0,01 047 


02328 

W 3 UlJlU 


-0 00003 

W , WW www 


5 


-0 00105 

u,uu i w w 


0651 5 

W , WWW 1 W 


38378 

W 3 WWW / w 


08168 

W , WW 1 uu 


-3,45475 


4,33225 


-0 93249 

W 3 JOttJ 


-0 51738 

W 3 W 1 / WW 


-0 00024 

W , W W Utt 


02646 

W 3 utU^U 


00074 

W , WWW / ' 


6 


00054 

W j WWWWt^ 


06745 

W 3 WW / *TW 


0,36471 


01668 

W 3 W 1 www 


-3 34703 

W , W^ /WW 


4,39475 


-1 03543 

1 . WW Wt^W 


-0,52184 


01085 

W 3 W 1 www 


02973 

W , W£- W / W 


001 52 

W , WW 1 w c 


7 


n ooiq^ 

W , WW 1 


W , WWC7 1 c. 


U,JtJ 1 vj 


-0 0444^ 






-1 ,14105 


-fl <i241 


o n??ft 


w , WOOWW 


00??R 


8 


n 00^1 ft 

W , WWW I o 


0701 7 


n <: ?? c i?i 


-0 1 Olfifi 

W, 1 W 1 WO 


-3 12422 




-1 24914 


-fl ^24 


O^fil 




n oo^n? 

U,UUOUt 


9 


0,00422 


07065 

W j V/ / WWW 


30503 

w , w w w w w 


-0 1 5503 

W . 1 w WWW 


-3 00969 


4,54921 


-1 35952 

1 3 w w w w c 


-0,52141 


04925 


03981 

W , WWWW 1 


00373 

w , WWW / w 


10 


00508 


07061 

W j V/ / WW 1 


0,28471 


-0,2045 


-2 89348 


4 58919 


-1 ,47197 


-0 51618 

W 3 W 1 W 1 w 


0,0637 


0431 9 

W , W^W 1 w 


0,0044 


1 1 


00577 


07007 

W j V/ / WW / 


26436 


-0 2501 3 

W , C- W W 1 W 


-2,77587 


4,62321 


-1 ,58627 


-0 5081 8 

W 3 WWW 1 w 


07895 

W 3 W / www 


04652 


00501 

W . WWWW 1 


12 


00629 

W , W W \JC-U 


06909 


0,2441 


-0,29194 


-2,6571 6 


4,651 18 


-1 ,7021 9 


-0,49727 


09494 


04979 

W , W^ W / w 


00556 

W , WWWW w 


13 


00666 


06769 

W j WW /WW 


0,224 


-0,33 


-2,53764 


4,67302 


-1 ,81949 


-0 48335 


0,1 1 166 


05295 

W 3 WWt-W w 


00604 

W j WWWWt^ 


14 


00687 

w , w w w w / 


0659? 

W , WWWW t. 


0,2041 6 


-0 36435 

W , W Ut^JJ 


-2,41 76 


4 68866 

i . WW WWW 


-1 ,93791 


-0,46627 


0,12904 


05597 

w , WWWW / 


00642 

W , W WW"t- 


15 


fl 00fiQ4 


W 3 vvvO O 


U, 1 otuo 




-2,29732 


4 fiQROfi 


-? 0^7? 


-fl AAW* 


fl 147f) ( i 

U, It / UvJ 


W,WWOO 1 


n 00^71 

W,WWW/ I 


16 




D 06144 


n ifi^fii 


-0 42223 


-2 1 771 


4 7012 


-2 1 771 


-0 42223 


n ifi'ifii 

U, 1 UJU 1 


n 0fi144 

W , WW 1 ii 


W j wwwww 


17 


00671 

W , UUU / 1 


05881 

W . W WWW 1 


0,14705 


-0 44593 


-2,0572 


4 69806 

i . WW WWW 


-2,29732 


-0 39506 

W . W W w w w 


18468 

W 3 1 U^UU 


06383 

w , WWWW w 


00694 


18 


00642 

w , w w wm^£- 


05597 

W , UJJJ / 


0,12904 


-0,46627 


-1 ,93791 


4 68865 

i . WW www 


-2,41759 


-0 36435 


0,2041 6 


06592 

W , WWWW C- 


00687 

W , WWWW / 


19 


00604 


05295 


0,1 1 166 


-0 48334 


-1 ,81949 


4,67301 


-2 53763 

, WW /WW 


-0,33 


0,224 


06769 

W 3 WW /WW 


00666 

W , WWWW w 


20 


00556 

W , UUJJU 


04979 

W , W^ W / w 


09494 


-0,49727 


-1 ,7021 9 


4,651 17 


-2,6571 5 


-0,29194 


0,24409 


06909 


00629 


21 


00501 


04652 


07894 


-0 50818 


-1 ,58627 


4,62321 


-2,77587 


-0 25013 


26436 

vy 3 £ — w r w ^ 


07007 


00577 


22 


0,0044 


0,04319 


0,0637 


-0,51618 


-1,47197 


4,58919 


-2,89348 


-0,2045 


0,28471 


0,07061 


0,00508 


23 


0,00373 


0,03981 


0,04925 


-0,52141 


-1,35952 


4,54921 


-3,0097 


-0,15503 


0,30503 


0,07065 


0,00422 


24 


0,00302 


0,03643 


0,03561 


-0,524 


-1,24915 


4,50339 


-3,12422 


-0,10168 


0,32521 


0,07017 


0,00318 


25 


0,00228 


0,03306 


0,0228 


-0,5241 


-1,14105 


4,45186 


-3,23677 


-0,04445 


0,34515 


0,06912 


0,00195 


26 


0,00152 


0,02973 


0,01085 


-0,52184 


-1 ,03544 


4,39477 


-3,34704 


0,01668 


0,36471 


0,06745 


0,00054 


27 


0,00074 


0,02646 


-0,00024 


-0,51738 


-0,93249 


4,33226 


-3,45476 


0,08168 


0,38378 


0,06515 


-0,00105 


28 


-0,00003 


0,02328 


-0,01047 


-0,51085 


-0,83239 


4,26452 


-3,55963 


0,15053 


0,40224 


0,06216 


-0,00284 


29 


-0,0008 


0,02021 


-0,01983 


-0,50241 


-0,73529 


4,19174 


-3,66138 


0,22319 


0,41993 


0,05847 


-0,00481 


30 


-0,00156 


0,01726 


-0,02834 


-0,49221 


-0,64135 


4,11413 


-3,75974 


0,29961 


0,43674 


0,05403 


-0,00696 


31 


-0,00229 


0,01445 


-0,03599 


-0,4804 


-0,55069 


4,03188 


-3,85445 


0,37972 


0,45251 


0,04882 


-0,00929 
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9.8.4 GricM mapping tables 

Table 9-44: GricM mapping table 



High-resolution 
Grid subband 


Gridl subband number 





1 


2 


3 


4 


5 


6 


7 


8 


9 





1 





























1 





1 


























2 








1 























3 











1 




















4 














0,5 

















5 














0,5 

















6 

















0,5 














7 

















0,33333 


0,09524 











8 

















0,16667 


0,19048 











9 




















0,28571 











10 




















0,21429 


0,05556 








11 




















0,14286 


0,11111 








12 




















0,07143 


0,16667 








13 























0,22222 








14 























0,17778 


0,03636 





15 























0,13333 


0,07273 





16 























0,08889 


0,10909 





17 























0,04444 


0,14545 





18 


























0,18182 





19 


























0,15152 


0,02778 


20 


























0,12121 


0,05556 


21 


























0,09091 


0,08333 


22 


























0,06061 


0,11111 


23 


























0,03030 


0,13889 


24 





























0,16667 


25 





























0,13889 


26 





























0,11111 


27 





























0,08333 


28 





























0,05556 


29 





























0,02778 



9.8.5 Quantization Levels for Residuals 



Table 9-45: ResidualLevels16 



Table 9-46: ResidualLevels8 



Residual Code 


Dequantized Value 





-1,3125 


1 


-1,1375 


2 


-0,9625 


3 


-0,7875 


4 


-0,6125 


5 


-0,4375 


6 


-0,2625 


7 


-0,0875 


8 


0,0875 


9 


0,2625 


10 


0,4375 


11 


0,6125 


12 


0,7875 


13 


0,9625 


14 


1,1375 


15 


1,3125 



Residual Code 


Dequantized Value 





-1,0 


1 


-0,625 


2 


-0,291666667 


3 


0,0 


4 


0,25 


5 


0,5 


6 


0,75 


7 


1,0 


Table 9-49: Residual Levels5 


Residual Code 


Dequantized Value 





-0,875 


1 


-0,375 


2 





3 


0,375 


4 


0,875 



Table 9-47: ResidualLevels3 



Residual Code 


Dequantized Value 





-0,645 


1 





2 


0,645 


Table 9-48: 
ResidualQuantizednLeveMO 


Residual Code 


Dequantized Value 





-0,47 


1 


0,47 


Table 9-50: 
ResidualQuantizednLevel16 


Residual Code 


Dequantized Value 





-0,645 


1 


0,645 
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9.8.6 Long window for filterbank 

Only first 128 coefficients are tabulated since the rest of the table is symmetric. 

Table 9-51 : Long window for filterbank 



1 


0,00000073053931 


2 


0,00001971603402 


3 


0,00009120922576 


4 


0,00024999557063 


5 


0,00053053202896 


6 


0,00096681736288 


7 


0,00159226421635 


8 


0,00243957342599 


9 


0,00354061104012 


10 


0,00492628851495 


11 


0,00662644654387 


12 


0,00866974296265 


13 


0,01108354515765 


14 


0,01389382738765 


15 


0,01712507341094 


16 


0,02080018478972 


17 


0,02494039522219 


18 


0,02956519123018 


19 


0,03469223950626 


20 


0,04033732119902 


21 


0,04651427338908 


22 


0,05323493798151 


23 


0,06050911821199 


24 


0,06834454293557 


25 


0,07674683883740 


26 


0,08571951067489 


27 


0,09526392963020 


28 


0,10537932982169 


29 


0,11606281299140 


30 


0,12730936135520 


31 


0,13911185857091 


32 


0,15146111874875 


33 


0,16434592339804 


34 


0,17775306617356 


35 


0,19166740525504 


36 


0,20607192316413 


37 


0,22094779379428 


38 


0,23627445640116 


39 


0,25202969627406 


40 


0,26818973178253 


41 


0,28472930746762 


42 


0,30162179282255 


43 


0,31883928638529 



44 


0,33635272474356 


45 


0,35413199603255 


46 


0,37214605748681 


47 


0,39036305659014 


48 


0,40875045535117 


49 


0,42727515721822 


50 


0,44590363613336 


51 


0,46460206721486 


52 


0,48333645854690 


53 


0,50207278354759 


54 


0,52077711337935 


55 


0,53941574886118 


56 


0,55795535133868 


57 


0,57636307196621 


58 


0,59460667885545 


59 


0,61265468154600 


60 


0,63047645225724 


61 


0,64804234338508 


62 


0,66532380071377 


63 


0,68229347182117 


64 


0,69892530916529 


65 


0,71519466735162 


66 


0,73107839409342 


67 


0,74655491439262 


68 


0,76160430748517 


69 


0,77620837611373 


70 


0,79035070771131 


71 


0,80401672710242 


72 


0,81719374035394 


73 


0,82987096943593 


74 


0,84203957738371 


75 


0,85369268368674 


76 


0,86482536966764 


77 


0,87543467365609 


78 


0,88551957580828 


79 


0,89508097247291 


80 


0,90412164005964 


81 


0,91264618842693 


82 


0,92066100387146 


83 


0,92817418187376 


84 


0,93519544983175 


85 


0,94173608009759 


86 


0,94780879372230 



87 


0,95342765540680 


88 


0,95860796025748 


89 


0,96336611304655 


90 


0,96771950078316 


91 


0,97168635950719 


92 


0,97528563632273 


93 


0,97853684779033 


94 


0,98145993589285 


95 


0,98407512287726 


96 


0,98640276634975 


97 


0,98846321606187 


98 


0,99027667386693 


99 


0,99186305834592 


100 


0,99324187559754 


101 


0,99443209765490 


102 


0,99545204993079 


103 


0,99631930900172 


104 


0,99705061191921 


105 


0,99766177808460 


106 


0,99816764454334 


107 


0,99858201534882 


108 


0,99891762541818 


109 


0,99918611905815 


110 


0,99939804308295 


111 


0,99956285418553 


112 


0,99968893996391 


113 


0,99978365275329 


114 


0,99985335517798 


115 


0,99990347612183 


116 


0,99993857562689 


117 


0,99996241707224 


118 


0,99997804486209 


119 


0,9999878657671 1 


120 


0,99999373201709 


121 


0,99999702423632 


122 


0,99999873234653 


123 


0,99999953263198 


124 


0,99999985926787 


125 


0,99999996875111 


126 


0,99999999584044 


127 


0,99999999980564 


128 


0,99999999999973 
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9.8.7 Delta Index for LFE ADPCM 



Table 9-52: Delta Index for 16-bit samples 



Table 9-53: Delta Index for 24-bit samples 



Code 


Index Adjustment 





-4 


1 


-3 


2 


-2 


3 


-1 


4 


2 


5 


4 


6 


6 


7 


8 



Code 


Index Adjustment 





-8 


1 


-8 


2 


-7 


3 


-7 


4 


-6 


5 


-6 


6 


-5 


7 


-5 


8 


-4 


9 


-4 


10 


-3 


11 


-3 


12 


-2 


13 


-2 


14 


-1 


15 


-1 


16 


1 


17 


1 


18 


2 


19 


2 


20 


3 


21 


3 


22 


4 


23 


4 


24 


5 


25 


5 


26 


6 


27 


6 


28 


7 


29 


7 


30 


8 


31 


8 
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9.8.8 Step Size for LFE ADPCM encoding 

Table 9-54: StepSize table for 16-bit samples 






2,1 3629566331 98035e-004 


1 


2,441 4807580797754e-004 


2 


2,7466658528397473e-004 


3 


2,7466658528397473e-004 


4 


3,051 85094759971 92e-004 


5 


3,357036042359691 1 e-004 


6 


3,967406231 8796350e-004 


7 


4,272591 3266396069e-004 


8 


4,577776421 3995788e-004 


9 


5,1 881 4661 091 95227e-004 


10 


5,79851 68004394665e-004 


11 


6,1 03701 8951 994385e-004 


12 


6,71 407208471 93823e-004 


13 


7,6296273689992981 e-004 


14 


8,23999755851 9241 9e-004 


15 


9,1 555528427991 577e-004 


16 


1 ,0071 1 081 27079073e-003 


17 


1 ,098666341 1 358989e-003 


18 


1 ,2207403790398877e-003 


19 


1 ,34281 441 69438765e-003 


20 


1 ,4648884548478652e-003 


21 


1 ,61 7481 002227851 2e-003 


22 


1 ,7700735496078372e-003 


23 


1 ,9531 846064638203e-003 


24 


2,1 3629566331 98035e-003 


25 


2,349925229651 7838e-003 


26 


2,594073305459761 3e-003 


27 


2,8687398907437361 e-003 


28 


3,1 4340647602771 08e-003 


29 


3,448591 5707876827e-003 


30 


3,7842951 75023651 8e-003 


31 


4,181035798211 61 53e-003 


32 


4,6082949308755760e-003 


33 


5,06607257301 55339e-003 



34 


5,554368724631 4890e-003 


35 


6, 103701 8951 994385e-003 


36 


6,7445905941 953795e-003 


37 


7,41 599780266731 77e-003 


38 


8,1 48442030091 251 2e-003 


39 


8,941 9232764671 782e-003 


40 


9,8574785607470940e-003 


41 


1 ,0834070863979004e-002 


42 


1 , 1 9327372051 1 4903e-002 


43 


1 ,31 22959074678793e-002 


44 


1 ,44352549821 46673e-002 


45 


1 ,586962492751 8540e-002 


46 


1 ,7456587420270394e-002 


47 


1 ,91 961 42460402233e-002 


48 


2,1 1 18808557390057e-002 


49 


2,322458571 1233862e-002 


50 


2,5543992431 409649e-002 


51 


2,81 0754722739341 3e-002 


52 


3,091 52500991 851 55e-002 


53 


3,40281 38065736867e-002 


54 


3,741 569261 7572556e-002 


55 


4,1 1 694692831 2021 5e-002 


56 


4,5258949552903834e-002 


57 


4,980620746482741 8e-002 


58 


5,478072450941 4958e-002 


59 


6,027405621 5094456e-002 


60 


6,6286202581 865905e-002 


61 


7,290871 91 381 57288e-002 


62 


8,020264290292061 8e-002 


63 


8,822901 08951 07887e-002 


64 


9,70488601 33671 075e-002 


65 


1 ,067537461 470381 8e-001 


66 


1,1743522446363720e-001 


67 


1 ,291 8485061 1 8961 1 e-001 



68 


1 ,420941 801 2024294e-001 


69 


1, 56285287026581 62e-001 


70 


1,71 91 07638782921 8e-001 


71 


1 ,891 2320322275461 e-001 


72 


2,0804467909787286e-001 


73 


2,28827784051 02694e-001 


74 


2,51 71 66661 5802485e-001 


75 


2,7689443647572254e-001 


76 


3,04574724570451 98e-001 


77 


3,350321 9702749720e-001 


78 


3,68541 5204321 421 1 e-001 


79 


4,05377361 36967073e-001 


80 


4,4593646046327096e-001 


81 


4,9052400280770286e-001 


82 


5,3956724753563035e-001 


83 


5,935239722891 9340e-001 


84 


6,5288247322000792e-001 


85 


7, 181 61 5649891 6595e-001 


86 


7,900021 3629566329e-001 


87 


8,6898403881 954400e-001 


88 


9,559007538071 8409e-001 


89 


1 ,0514847254860074e+000 


90 


1 , 1 5662099063081 76e+000 


91 


1 ,2722861 41 5448470e+000 


92 


1 ,39951 78075502792e+000 


93 


1 ,5394756920072024e+000 


94 


1,693411 0538041 323e+000 


95 


1 ,8627582628864405e+000 


96 


2,0490432447279274e+000 


97 


2,25394451 73497725e+000 


98 


2,47932370983001 20e+000 


99 


2,7272865993224893e+000 


100 


3,0000000000000000e+000 
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Table 9-55: StepSize table for 24-bit samples 



n 
u 


R R7R97Q1 1 9R4Q1 9Qflp-n0fi 

0,0/0£/yi I £04y 1 £\TOC UUD 


I 


o qrrqo70941 R4049Rp-nnR 
o,yojyu / u£4 \ o4U4£Oc uuo 


£ 


4 41 07449RQ1 R0RQR4p DDR 

4 ,4 i u / 44£oy i ouoyj^c uuo 


o 
O 


A 7RRR791 RriAP.RROP.AB OOfi 
4, / Duo / £ I OU^DDOUD'fU UUD 


A 
4 


R 94R90QRRRR1 90R70p OOfi 
0,£4 0£UyOOOO 1 £UO /Uc UUD 


O 


R R41 9RRRR4R9094RRp OOfi 
0,04 I £000040£U£400c UUD 


c 
O 


fi 4R7R0940R1 9R4RRfip DDR 
D,40 / OU£4UO I ^OfojDc UUD 


7 
/ 


7 0RRR4RQ91 QRfifi91 Qp-Onfi 
/ ,U00040y£ 1 yODO£ 1 ac UUD 


Q 
O 


7 74Rfi04744R0fi447Qp-0nfi 

/ , / 4DDU4 / 440U044 / 3c UUD 


y 


R AfiRRfiORR7n7fi97Rflp-nnfi 
O^DjODUjD / U / Oil / OOc UUD 


1 n 

I u 


Q 41 7RR4QQ71 RQR7R1 p-OOfi 
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9.8.9 Scaling coefficients lookup table 

Table 9-56: DMixScalingJndexTodB 






-200 dB 


1 


OdB 


2 


-0,5 dB 


3 


-1,0dB 


4 


-1,5 dB 


5 


-2,0 dB 


6 


-2,5 dB 


7 


-3,0 dB 



8 


-3,5 dB 


9 


-4,0 dB 


10 


-4,5 dB 


11 


-5,0 dB 


12 


-5,5 dB 


13 


-6,0 dB 


14 


-6,5 dB 


15 


-7,0 dB 



16 


-7,5 dB 


17 


-8,0 dB 


18 


-8,5 dB 


19 


-9,0 dB 


20 


-9,5 dB 


21 


-10,0 dB 


22 


-10,5 dB 


23 


-11,0dB 



24 


-11,5dB 


25 


-12,0 dB 


26 


-12,5 dB 


27 


-13,0 dB 


28 


-13,5 dB 


29 


-14,0 dB 


30 


-14,5 dB 


31 


-15,0 dB 



Table 9-57: DMixContribution IndexTodB 






-200 dB 
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OdB 
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-0,5 dB 
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-1,0dB 
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-1,5dB 
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-2,0 dB 
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-2,5 dB 


7 


-3,0 dB 


8 


-3,5 dB 


9 


-4,0 dB 


10 


-4,5 dB 


11 


-5,0 dB 


12 


-5,5 dB 


13 


-6,0 dB 


14 


-6,5 dB 


15 


-7,0 dB 



16 


-7,5 dB 


17 


-8,0 dB 


18 


-8,5 dB 


19 


-9,0 dB 


20 


-9,5 dB 


21 


-10,0 dB 


22 


-10,5 dB 


23 


-11,0 dB 


24 


-11,5 dB 


25 


-12,0 dB 


26 


-12,5 dB 


27 


-13,0 dB 


28 


-13,5 dB 


29 


-14,0 dB 


30 


-14,5 dB 


31 


-15,0 dB 



32 


-15,5dB 


33 


-16,0 dB 


34 


-16,5 dB 


35 


-17,0dB 


36 


-17,5dB 


37 


-18,0 dB 


38 


-18,5 dB 


39 


-19,0 dB 


40 


-19,5 dB 


41 


-20,0 dB 


42 


-21,0 dB 


43 


-22,0 dB 


44 


-23,0 dB 


45 


-24,0 dB 


46 


-25,0 dB 


47 


-26,0 dB 



48 


-27,0 dB 


49 


-28,0 dB 


50 


-29,0 dB 


51 


-30,0 dB 


52 


-31,0 dB 


53 


-32,0 dB 


54 


-33,0 dB 


55 


-34,0 dB 


56 


-35,0 dB 


57 


-36,0 dB 


58 


-37,0 dB 


59 


-38,0 dB 


60 


-39,0 dB 


61 


-40,0 dB 



9.8.10 Index Hopping Huffman Tables 

Table 9-58: Codebook for Tonal Groups 



Index 


prsDist [nGroup] 


tnlScf 





1 


2 


3 


4 


A 


B 


A 


B 


A 


B 


A 


B 


A 


B 


A 


B 





0x01 


0x00 


0x01 


0x00 


0x01 


0x00 


0x01 


0x00 


0x01 


0x00 


0x01 


0x00 


1 


0x01 


0x38 


0x01 


0x30 


0x01 


0x38 


0x01 


0x36 


0x01 


0x2C 


0x01 


0x1 A 


2 


0x01 


0x04 


0x01 


0x16 


0x01 


0x1 C 


0x01 


0x22 


0x01 


0x02 


0x01 


0x16 


3 


0x01 


0x02 


0x01 


0x04 


0x01 


0x18 


0x01 


OxOC 


OxFF 


0x02 


0x01 


0x02 


4 


OxFF 


0x05 


0x01 


0x02 


0x01 


0x16 


0x01 


OxOA 


0x01 


0x12 


OxFF 


0x03 


5 


OxFF 


0x04 


OxFF 


0x04 


0x01 


0x14 


0x01 


0x04 


0x01 


0x10 


0x01 


0x12 


6 


0x01 


0x26 


OxFF 


0x07 


0x01 


0x04 


0x01 


0x02 


0x01 


0x08 


0x01 


0x10 


7 


0x01 


0x24 


0x01 


0x02 


0x01 


0x02 


OxFF 


0x08 


0x01 


0x06 


0x01 


0x02 


8 


0x01 


0x22 


OxFF 


OxOA 


OxFF 


OxOE 


OxFF 


0x02 


0x01 


0x02 


OxFF 


OxOB 


9 


0x01 


0x20 


0x01 


OxOE 


OxFF 


0x11 


0x01 


0x02 


OxFF 


0x07 


0x01 


OxOC 


10 


0x01 


0x18 


0x01 


OxOC 


0x01 


0x02 


OxFF 


0x07 


0x01 


0x02 


0x01 


OxOA 


11 


0x01 


0x02 


0x01 


OxOA 


OxFF 


OxOF 


0x01 


0x02 


OxFF 


OxOF 


0x01 


0x02 


12 


OxFF 


0x13 


0x01 


0x08 


0x01 


OxOC 


OxFF 


0x17 


OxFF 


0x15 


OxFF 


0x10 


13 


0x01 


0x14 


0x01 


0x04 


0x01 


0x02 


OxFF 


OxOC 


OxFF 


0x03 


0x01 


0x02 


14 


0x01 


0x06 


0x01 


0x02 


OxFF 


0x17 


OxFF 


0x05 


0x01 


0x02 


OxFF 


0x11 


15 


0x01 


0x04 


OxFF 


0x03 


0x01 


0x02 


0x01 


0x10 


OxFF 


0x06 


0x01 


0x02 


16 


0x01 


0x02 


OxFF 


0x1 B 


OxFF 


0x1 C 


0x01 


OxOE 


0x01 


0x02 


OxFF 


0x12 


17 


OxFF 


0x21 


0x01 


0x02 


0x01 


0x02 


0x01 


0x02 


OxFF 


OxOD 


0x01 


0x02 


18 


OxFF 


0x1 F 


OxFF 


0x1 D 


OxFF 


0x1 D 


OxFF 


OxOA 


0x01 


0x02 


OxFF 


0x13 


19 


OxFF 


0x1 C 


OxFF 


0x1 C 


0x01 


0x04 


0x01 


OxOA 


OxFF 


OxOE 


OxFF 


0x00 


20 


0x01 


OxOC 


OxFF 


0x16 


0x01 


0x02 


0x01 


0x02 


OxFF 


0x12 


OxFF 


OxOF 


21 


0x01 


OxOA 


OxFF 


0x15 


OxFF 


0x1 E 


OxFF 


0x14 


OxFF 


0x04 


OxFF 


OxOE 



ETSI 



191 



ETSI TS 102 114 V1. 4.1 (2012-09) 



Index 


prsDist [nGroup] 


tnlScf 





1 


2 


3 


4 


A 
M 


R 


A 
M 


D 


A 
M 


D 


A 
M 


p 


A 
M 


p 


A 


p 

D 


OO 
dd 


0x01 


0x04 


OxFF 


0x0 F 


OxFF 


0x00 


0x01 


0x02 


0x01 


0x08 


OxFF 


0x09 


do 


0x01 


0x02 


OxFF 


0x0 E 


OxFF 


0x1 F 


OxFF 


0x19 


0x01 


0x02 


OxFF 


0x07 


OA 

dQ 


OxFF 


0x22 


0x01 


0x18 


OxFF 


0x19 


0x01 


0x02 


OxFF 


0x05 


0x01 


0x02 


o^ 
do 


OxFF 


0x25 


0x01 


0x02 


OxFF 


OxOA 


OxFF 


0x1 A 


0x01 


0x04 


OxFF 


0x02 


do 


0x01 


0x04 


OxFF 


0x08 


OxFF 


0x08 


0x01 


0x02 


0x01 


0x02 


OxFF 


0x04 


07 

d I 


0x01 


0x02 


0x01 


0x14 


0x01 


0x02 


OxFF 


0x1 B 


OxFF 


OxOB 


0x01 


0x04 


OQ 

do 


OxFF 


0x23 


0x01 


0x02 


OxFF 


0x09 


OxFF 


0x00 


OxFF 


OxOA 


0x01 


0x02 


d\d 


OxFF 


0x00 


OxFF 


0x10 


OxFF 


0x04 


OxFF 


0x16 


OxFF 


0x14 


OxFF 


0x01 


oU 


OxFF 


0x24 


0x01 


0x02 


0x01 


0x1 A 


OxFF 


0x09 


0x01 


OxOC 


OxFF 


0x05 


o\ 


OxFF 


0x20 


OxFF 


0x13 


0x01 


0x0 E 


0x01 


0x04 


0x01 


OxOA 


0x01 


0x08 


oo 

0£1 


Oxrr 


0x1 E 


A*r A ^ 

0x01 


AwAO 

0x02 


0x01 


OxOC 


AwA ^ 

0x01 


0x02 


0x01 


AwAO 

0x08 


0x01 


0x06 


OO 
OO 


OxFF 


0x18 


A, rl^CT 

OxFF 


0x17 


0x01 


OxOA 


OxFF 


OxOD 


0x01 


0x02 


0x01 


0x04 


04 


0x01 


0x02 


0x01 


0x02 


0x01 


0x04 


OxFF 


0x1 1 


OxFF 


OxOC 


0x01 


0x02 


0^ 


OxFF 


0x16 


OxFF 


0x1 A 


0x01 


0x02 


OxFF 


0x04 


0x01 


0x02 


OxFF 


OxOC 


0£ 
OO 


0x01 


0x02 


0x01 


0x02 


UXrr 


0x16 


0x01 


0x1 2 


Aw^^ 

Oxrr 


0x10 


OxFF 


OxOD 


07 
O / 


Oxrr 


0x1 7 


Oxrr 


0x1 E 


UXrr 


0x03 


0x01 


OxOA 


0x01 


0x02 


OxFF 


OxOA 


OQ 
OO 


AwA ^ 
0X01 


AwAO 

0X02 


AwA ^ 

UXU1 


AwAO 

OxOo 


AwA H 

0x01 


AwAO 

0X02 


AwA ^ 

0X01 


AwAC 

0x0b 


OxFF 


0x16 


OxFF 


0x08 


Oa 


UXrr 


A\^ r~. 
UX1 U 


AwA ^ 

UXU1 


UxUb 


UXrr 


A\^ C 

0X1 O 


AwA ^ 

0X01 


AwAO 

0X02 


OxFF 


0x00 


OxFF 


0x06 


An 

4U 


Oxrr 


0X1 D 


AwA ^ 

0x01 


AwAO 

0x02 


AwA H 

0x01 


0x02 


Aw^C 

Oxrr 


AwA d 

0x0 1 


OxFF 


0x11 




A1 

4 I 


Oxrr 


0x1 1 


Oxrr 


AwO ^ 

0x21 


Oxrr 


0x1 A 


AwA ^ 

0x01 


AwAO 

0x02 


OxFF 


0x13 


AO 


OxFF 


0x0 E 


0x01 


0x02 


OxFF 


0x1 B 


OxFF 


0x1 3 


0x01 


0x02 


AO 
40 


OxFF 


0x07 


OxFF 


0x22 


OxFF 


OxOC 


OxFF 


0x1 8 


OxFF 


0x08 


AA 

44 


0x01 


OxOC 


OxFF 


0x00 


OxFF 


OxOB 


0x01 


0x02 


OxFF 


0x09 


A^ 
40 


AwA ^ 

0x01 


0x02 


Oxrr 


AwOA 

0x20 


AwA H 

0x01 


AwA A 

OxOA 


Aw^^ 

Oxrr 


AwAO 

0x03 


OxFF 


0x01 


4b 


OxFF 


OxOC 


OxFF 


0x1 F 


0x01 


0x04 


OxFF 


OxOB 






A 7 


0x01 


0x02 


OxFF 


OxOC 


0x01 


0x02 


0x01 


0x04 




yl Q 
4o 


PlvFF 

UXrr 


PlvPH 
UXU I 


AvPP 

uxr r 


AvA£ 
UXUO 


uxr r 


Av1 A 
UX I U 


0x01 


0x02 


AO 

4y 


PlvPH 
UXU 1 


PlvPl£ 

uxuo 


AvA1 
UXU I 


AvAP 
UXUtZ 


AvPP 

uxr r 


Av1 O 

UX I £L 


OxFF 


0x15 


50 


PlvPH 
UXU 1 


PlvPM 
UXU4 


AvA1 
UXU I 


AvAO 
UXU£ 


AvA1 
UXU I 


AvA/1 
UXU4 


OxFF 


0x12 


51 


PlvPH 
UXU 1 


PlvPlO 
UXU*i 


AvPP 
UXr r 


AvAQ 

uxuy 


AvAl 
UXU 1 


AvAO 
UXU£ 


0x01 


0x02 


52 


PlvFF 
UXr r 


UX I M 


AvA1 
UXU 1 


AvAO 
UXU*i 


uxr r 


Av1 A 
UX I °r 


OxFF 


0x10 


53 


OyFF 

UXr r 


PIyPIO 

uxuo 


AvPP 

UXrr 


AyA1 
UXU I 


AvPP 
uxr r 


Ay1 P. 
UX I O 


OxFF 


0x0 F 


54 


OyFF 

UAr r 


P»y1 Q 
ux i y 


UXU 1 


AyAS 
uxuo 


OxFF 


0x13 


OxFF 


0x06 


55 


OyFF 

uArr 


Oy1 A 


AyA1 

UAU 1 


AyAK 
uxuo 


OxFF 


OxOD 


OxFF 


0x01 


JD 


PlvFF 

UXrr 


PlvPlQ 
UXUO 


AvA1 
UXU I 


AvAO 
UXU^i 


OxFF 


0x05 








0x01 


0x0 E 


OxFF 


0x14 


0x01 


0x02 




f^Q 


0x01 


0x04 


0x01 


0x02 


OxFF 


0x01 




AwA ^ 

0x01 


0x02 


Air IT IT 

Oxrr 


0x19 


0x01 


0x02 


ou 


UXrr 


A V A A 

UXUA 


OxFF 


0x18 


OxFF 


0x06 


O I 


UXrr 


PlvPlR 


OxFF 


0x12 


OxFF 


0x07 


RO 


0x01 


0x08 


OxFF 


0x11 






£0 
OO 


0x01 


0x04 


0x01 


0x02 




£A 
04 


AwA ^ 

0x01 


0x02 


OxFF 


0x06 


OO 


PlvCC 
UXrr 


PlvPlC 

UXUr 


0x01 


0x02 


OO 


OxFF 


0x10 


OxFF 


OxOB 


fi7 


0x01 


0x02 


OxFF 


OxOD 


68 


OxFF 


0x12 






69 


OxFF 


0x15 




70 


OxFF 


OxOB 


71 


0x01 


0x02 


72 


OxFF 


0x09 


73 


OxFF 


0x06 
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Table 9-59: Codebook for Amplitude and Phase 



Index 


rsdAmp 


fstRsdAmp 


dPH 


dAmp 


rsdAp 




A 


R 


A 


R 


A 


R 


A 
n 


R 


A 


B 


n 

u 


n v ni 

UXU I 


uxuu 


UXU I 


uxuu 


UXU I 


rivnn 
UXUU 


UXU I 


rivnn 
UXUU 


flYfll 
UAU 1 


UAUU 


I 


UXU I 


UX^iU 


UXU I 


UX*:0 


UXU I 


uxu^# 


UXU I 


flvflP 
UXUO 


flYfll 
UAU 1 


OyOP 

UAUl 


2 


(i v ni 

UXU I 


UaU*+ 


rwni 

UXU I 


Ua I O 


(Iv(l1 
UXU I 


DY09 
uxu^_ 


flYfll 
UAU 1 


flYfl9 
UAUil 


OxFF 


UAU 1 




UaU I 


UAU^I 


rwni 

UXU I 


UAUH- 


DyFF 

UAr r 


DyD9 
UXU^l 


DyFF 

UAr n 


flYfl9 


flYfll 
UAU 1 


OyO? 


4 


DyFF 


UAUt 


UAU 1 


OyDP 

UAUl 


DyDI 

UAU 1 


DYD9 


flYfll 
UAU 1 


flYfl9 


OxFF 


0x02 


5 


OxFF 


0x01 


OxFF 


OxOC 


OxFF 


0x03 


OxFF 


0x03 


0x01 


0x02 


6 


0x01 


0x02 


OxFF 


0x1 1 


0x01 


0x02 


0x01 


0x02 


OxFF 


0x03 


7 


OxFF 


0x05 


0x01 


0x06 


OxFF 


0x04 


OxFF 


0x04 


0x01 


0x02 


8 


0x01 


0x1 8 


0x01 


0x04 


0x01 


0x02 


0x01 


0x02 


OxFF 


0x04 


9 


0x01 


0x16 


0x01 


0x02 


OxFF 


0x05 


OxFF 


0x05 


0x01 


0x02 


10 


0x01 


0x12 


OxFF 


0x01 


0x01 


0x02 


0x01 


0x02 


OxFF 


0x05 


11 


0x01 


OxOC 


OxFF 


0x08 


OxFF 


0x06 


OxFF 


0x06 


OxFF 


0x00 


12 


UAU 1 


OyO? 


OxFF 


UAU Zf 


OxFF 


0x00 


OxFF 


0x00 




13 


OxFF 


DyOF 

UAU 1— 


UAU 1 


UAUU 


OxFF 


0x01 


OxFF 


0x01 


14 


UAU 1 


DyOP 

UAUt 


UAU 1 


UAUH- 


OxFF 


0x05 






15 


DyFF 
UAr i 


Dy1 A 

UA I *+ 


UAU 1 


OyOP 

UAUl 


OxFF 


0x00 




16 


DyDI 

UAU 1 


(IyDP 

UAUt 


DyFF 

UAr i 


Oy1 4 

UA I *+ 


OxFF 


0x04 


17 


OyFF 
uat r 


Oy1 A 

UA I M 


OyFF 
uat r 


UXUO 


OxFF 


0x08 


I o 


UXU 1 


UXU4 


HvFF 

uxr r 


UXUO 








UXU 1 


UXU*_ 


UXU 1 


UXU*_ 




£.U 


uxr r 


ux i y 


uxr r 


UXUD 


91 
£ i 


uxr r 


UX^iU 


UXU 1 


UXU*: 


9? 


uxr r 


UX I o 


n v FF 

uxr r 


UXU*: 


9T 
*io 


UXU 1 


UXU^i 


UXU 1 


UXUD 




uxr r 


Oy1 n 

UX I u 


UXU 1 


0y09 
UXU*. 


9<i 


UXU 1 


uxu^ 


uxr r 


Oy1 fi 
UX I 


9fi 


uxr r 


UX 1 o 


UXU 1 


Ovoo 

UXU*: 


97 


uxr r 


UX I I 


flvFF 

uxr r 


Ovl 7 
UX I / 


98 


UXU 1 


UXU*: 


HvFF 

uxr r 


Ovoo 
uxuu 


9Q 


OvFF 
uxr r 


UXUL/ 


uxr r 


Oy1 fi. 
UX I 




uxr r 


uxuu 


UXU I 


OyOPI 
UXUO 


T1 


uxr r 


uxuy 


UXU I 


0v09 
UXU*: 


"39 
o£ 


uxr r 


O.vO.7 
UXU / 


rivFF 

uxr r 


UXUD 


oo 


UXU 1 


UXU*: 


UXU I 


0v09 
UXU*: 


ot 


uxr r 


UXUo 


uxr r 


Oy1 ^ 
UX I o 




UXU 1 


uxu*_ 


UXU I 


uxu^ 


oo 


uxr r 


UXU4 


rivFF 

uxr r 


UXU / 


^7 
o / 


UXU 1 


n v -j p 
UX I L/ 


uxr r 


UXU4 


TQ 
OO 


UaU 1 


UaU*+ 


OyFF 
uat r 


Oy1 n 

Ua I U 


39 


0x01 


0x02 


0x01 


0x06 


40 


OxFF 


0x08 


0x01 


0x04 


41 


OxFF 


OxOB 


0x01 


0x02 


42 


0x01 


0x16 


OxFF 


OxOA 


43 


0x01 


0x04 


OxFF 


0x12 


44 


0x01 


0x02 


OxFF 


0x0 F 


45 


OxFF 


0x12 


0x01 


0x02 


46 


OxFF 


0x0 F 


OxFF 


0x0 D 


47 


AwA h 

0x01 


0x1 


OxFF 


0x0 E 


48 


UXU1 


UXUO 






49 


UXU1 


UXUA 




50 


UXU1 


UXU*: 


51 


uxr r 


UX I c 


52 


0x01 


0x04 


53 


0x01 


0x02 


54 


OxFF 


0x24 


55 


OxFF 


0x22 


56 


0x01 


0x02 


57 


OxFF 


0x1 D 


58 


OxFF 


0x00 


59 


OxFF 


0x15 
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Index 


rsdAmp 


fstRsdAmp 


dPH 


dAmp 


rsdAppx 


A 


B 


A | B 


A | B 


A j B 


A ! B 


60 


0x01 


0x02 




61 


OxFF 


0x1 C 


62 


OxFF 


0x17 


63 


OxFF 


0x16 


64 


OxFF 


OxOA 


65 


OxFF 


0x06 



Table 9-60: Code book for Grid Reconstruction 



inoGx 


stGrid 


avgG3 


A 


B 


A 


A 





0x01 


0x00 


0x01 


0x00 


1 


0x01 


0x2A 


0x01 


0x20 


2 


0x01 


0x28 


0x01 


0x1 E 


3 


0x01 


0x04 


0x01 


OxOA 


4 


0x01 


0x02 


0x01 


0x02 


5 


OxFF 


0x04 


OxFF 


0x0 E 


6 


OxFF 


0x03 


0x01 


0x06 


7 


0x01 


0x02 


0x01 


0x02 


8 


OxFF 


0x08 


OxFF 


OxOB 


9 


0x01 


0x20 


0x01 


0x02 


10 


0x01 


OxOA 


OxFF 


0x13 


11 


0x01 


0x08 


OxFF 


0x09 


12 


0x01 


0x02 


OxFF 


0x0 D 


13 


OxFF 


0x0 E 


0x01 


0x12 


14 


0x01 


0x02 


0x01 


0x10 


15 


OxFF 


0x07 


0x01 


0x02 


16 


0x01 


0x02 


OxFF 


OxOA 


17 


OxFF 


0x09 


0x01 


0x04 


18 


OxFF 


0x16 


0x01 


0x02 


19 


OxFF 


OxOC 


OxFF 


0x14 


20 


0x01 


0x14 


OxFF 


0x08 


21 


0x01 


0x0 E 


0x01 


0x08 


22 


0x01 


0x02 


0x01 


0x06 


23 


OxFF 


0x10 


0x01 


0x02 


24 


0x01 


0x02 


OxFF 


0x06 


25 


OxFF 


OxOB 


0x01 


0x02 


26 


0x01 


0x02 


OxFF 


0x17 


27 


OxFF 


0x0 D 


OxFF 


0x00 


28 


0x01 


0x04 


OxFF 


0x15 


29 


0x01 


0x02 


OxFF 


0x07 


30 


OxFF 


0x11 


OxFF 


OxOC 


31 


OxFF 


0x0 F 


OxFF 


0x12 


32 


0x01 


0x02 


OxFF 


0x10 


33 


OxFF 


0x18 


0x01 


0x02 


34 


OxFF 


0x00 


OxFF 


0x0 F 


35 


0x01 


0x02 


OxFF 


0x11 


36 


OxFF 


0x12 




37 


0x01 


0x02 


38 


OxFF 


0x14 


39 


OxFF 


0x02 


40 


OxFF 


0x05 


41 


OxFF 


OxOA 


42 


OxFF 


0x01 


43 


OxFF 


0x06 
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Annex B (normative): 
CRC Algorithm 

A number of CRC algorithms are in use throughout the world. This Annex described the CRC method used by all of the 
DTS algorithms. This algorithm of CRC is also known as CRC-CCITT. 

The CRC is used for two purposes: to verify the correctness of header data and to greatly reduce the probability of false- 
alias sync-word detection. When flagged, a CRC checksum is added to each packet within the stream. CRCs are used to 
verify the data of various components within the frame. It is important to verify the header information, since the 
mechanism for traversing through the frame is by reading the size of a component from the NAVI table and locating the 
start position of the component based on its value. 

The CRC 16 polynomial is: 

G(x) = x 16 + x 12 +x 5 +1 
The CRC16 is initialized to the value of OxFFFF before checksum computation commences. 
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Annex C (informative): 
Example Pseudocode 

This annex provides detail on various processing algorithms that will assist the reader by providing a top level view of 
critical processes, as well as examples to aid in correctly interpreting the bitstream. 



C.1 Overview of main function calls 

This clause outlines in detail pseudocode examples to clarify the details of the main function calls, unpacking of the 
frame and primary audio coding headers. 

Based on this subframe structure, the procedure of decoding a subframe may be illustrated by the following 
pseudocode: 

DecodeSubf rame { ) { 

// Unpack Side Information. 
UnpackSidelnf ormation { ) ; 

// Inverse VQ to extract high frequency subbands . 

for (nChannel=0 ; nChannel<nNumPrimaryChannels ; nChannel++) { 

for (nSubband=nHFreqVQBegin; nSubband<nHFreqVQEnd; nSubband++) { 
VQIndex = ExtractVQIndex ( ) ; 

InverseVQ (VQIndex) ; // One index looks up 32 samples in one subband analysis window. 

} } 

// Unpack the LFE channel 

ExtractLFEDecimatedSamples { ) ; // Extract the decimated samples. 
InterpolateLFESamples ( ) ; // Interpolate for all LFE samples. 

// Unpack subsubf rames . 

for (nSubsubf rame=0 ; nSubsubf rame<nNumOf Subsubf rames ; nSubsubf rame++ ) { 
UnpackSubsubf rame { ) ; 

} 

// Reconstruct all primary channels through filter bank interpolation 
for (nChannel=0 ; nChannel<nNumPrimaryChannels ; nChannel++) { 
ReconstructChannel ( ) ; 

}} 

A subsubframe consists of eight subband samples (a subband analysis subwindow) for each subband of all primary 
channels, so its decoding procedure may be described as: 

UnpackSubsubf rame { ) { 

for (nChannel=0 ; nChannel<nNumPrimaryChannels ; nChannel++) 
for (nSubband=0 ; nSubband<nHFreqVQBegin; nSubband++) 
UnpackOneSubwindow ( ) ; // Get 8 subband samples. 

} 

An example of synchronization and decoding procedure may be described as follows: 

START_SYNC: InSyncFlag =0; // Search for extend sync word (38-bit sync word + 

extension) 

SearchForExtSync ( ) ; 

// Search for another sync word (32-bit sync word) 
SearchForSync ( ) ; 

// Count the distance between the two sync words and check if it is within the 
// limits. The next sync word is expected at this distance. 
InSyncFlag = CountSyncDist ( ) ; 
if (InSyncFlag==l) 

DecodeOneFrame ( ) ; // Decode the received frame 

else 

Goto START_SYNC; // Decode the remaining frames 

while (NotEndOf BitStream) { // Check if sync word occurred at the expected interval 

InSyncFlag = CheckSync ( ) ; 
if (InSyncFlag==l) 

DecodeOneFrame ( ) ; 

else 
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Goto START SYNC; 



C.2 Decoding Algorithms 

This clause outlines the decoding routines utilized by Coherent Acoustics. 



C.2.1 Block Code 

We will present two versions of the block code decoder based on: 

• the table look-up method; 

• the arithmetic method that requires one modulus division and one integer division per one decoded 
quantization index. 

The table look-up based decoding of a block code may be best illustrated by an example. Suppose a code of 64 is 
received as a three level block code. This code can be decoded as follows: 



1st Element: 64 = 


= 3x21 + 1; 


so 


quantization index = 





2nd Element: 21 


= 3x7 + 0; 


so 


quantization index = 


-1 


3rd Element: 7 = 


3x2+1; 


so 


quantization index = 





4th Element: 2 = 


3x0 + 2; 


so 


quantization index = 


+1 



where the quantization indexes are obtained by using the residuals to look up the quantization index table [-1,0, 1]. In 
summary, the quantization indexes of the four samples are (0, -1, 0, +1). 

The same code can be decoded using the code book of table V.3 in clause D.6.1. In order to facilitate the decoding 
process, this table is rearranged to give Table C.l. Then this code of 64 is decoded as follows: 

4th Element: 64 - 54 = 10 > 0; so quantization index = +1 

3rd Element: 10-9=1>0; so quantization index = 

2nd Element: 1 - = 1 > 0; so quantization index = -1 

1st Element: 1 - 1 = > 0; so quantization index = 

Therefore, the quantization indexes of the four samples are (0, -1, 0, +1). A general decoding procedure is given in the 
following pseudocode, assuming that the block codes in clause D.6 are rearranged as in Table C-l. 

Table C-1 : 3-level 4-element 7-bit Block Code Book 





Quantization Level index 


-1 





+0 


Code 


1st Element 





1 


2 


For 


2nd Element 3 6 





3 


6 




3rd Element 9 18 





9 


18 




4th Element 27 54 





27 


54 



int DecodeBlockCode ( int nCode, int *pnValue) { 
// nCode : Input code to be decoded. 

// nNumElement : Number of elements (samples) encoded 

// in a block. 

// nNumLevel: Number of quantization levels. 

// *pnValue: Array of decoded sample values. 

// *pnTable : Pointer to the code book. The code book is 

// organized as an array, each row of which contains 

// the code book for a particular element (sample) . 

pnValue += 3 ; 

nOffset = (nNumLevel-1) /2 ; 

int *pnEntry; // Pointer to the entries in the code book. 
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for (int n=nNumElement ; n>0; n--) { 

pnEntry = pnTable + n*nNumLevel ; // Point to the last entry 

// in the code book, 
for (int m=0; m<nNumLevel; m++) { 
pnEntry- - ; 

if { nCode >= *pnEntry ) { 
nCode -= *pnEntry; 

*pnValue = nOffset-m; // quantization index is calculated, 
if { nCode<0 ) { 

printf { "ERROR : block code look-up fail.\n"); 

return NULL; 

} 

break; 

} 

} 

pnValue- - ; 

} 

// Check if look-up successful 
if { nCode == ) 

return 1 ; 
else { 

printf { "ERROR : block code lock-up fail.\n"); 
return NULL; 




Very compact version of the block code decoder that does not use table look-up can be obtained using the modulus and 
integer division. The pseudocode that implements this version of the decoder is listed below. 

int DecodeBlockCode { int nCode, int *pnValue) { 
// nCode : Input code to be decoded. 
// nNumElement : Number of elements (samples) encoded in a block. 

// nNumLevel: Number of quantization levels. 
// *pnValue: Array of decoded sample values. 
nOffset = (nNumLevel- 1 ) >>1 ; 
for (int n=0; n< nNumElement; n++) { 

pnValue [n] = (nCode % nNumLevel) - nOffset; 
nCode /= nNumLevel; 

} 

if ( nCode == ) 

return 1; 
else { 

printf ( "ERROR : block code lock-up fail.\n"); 
return NULL; 




C.2.2 Inverse ADPCM 

Inverse ADPCM process is executed for each sample in a subband whose PMODE=l: 

void InverseADPCM (void) { 

// NumADPCMCoef f =4, the number of ADPCM coefficients. 
// raADPCMcoef f [] are the ADPCM coefficients extracted 
// from the bit stream. 

// raSample [NumADPCMCoef f] , raSample[-l] are the 

// history from last subframe or subsubframe. It is 
// updated each time before reverse ADPCM is run for a 
// block of samples for each subband. 
for (m=0; m<nNumSample ; m++) 

for (n=0; n<NumADPCMCoef f ; n++) 

raSample [m] += raADPCMcoef f [n] *raSample [m-n- 1] ; 

} 



C.2.3 Joint Subband Coding 

for (ch=0; ch<nPCHS; ch++) 

if ( JOINX[ch]>0 ){ // Joint subband coding enabled. 

nSourceCh = J0INX[ch]-l; // Get source channel. JOINX counts 

// channels as 1,2,3,4,5, so minus 1. 
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for (n=nSUBS [ch] ; n<nSUBS [nSourceCh] ; n++) 
for (nSample=0; n<8*nSSC; nSample++) 

aPrmCh[ch] .aSubband[n] . aSample [nSample] = JOIN_SCALES [ch] [n] * 
aPrmCh [nSourceCh] .aSubband[n] . aSample [nSample] ; 
} 

} 

C.2.4 Sum/Difference Decoding 

If flag SUMF is set, the front left and right channels are sum/difference encoded and are therefore appropriately 
decoded to produce the correct signals for the front left and right channels. Decoding is achieved by operating on the 
reconstructed subband samples: 

for (n=0; n<nSUBS; n++) // All active subbands . 

for (nSample=0; nSample<8*nSSC; nSample++) { // Samples in all subsubframes 
FrontLef t [nSample] = Fief t [nSample] + Fright [nSample] ; 
Frontright [nSample] = Fief t [nSample] - Fright [nSample] ; 

} 

This decoding is also required when AMODE = 3. 

Similarly when SUMS is set the reconstructed subband samples of the Left and right surround channels are decoded as: 

for (n=0; n<nSUBS; n++) // All active subbands. 

for (nSample=0; nSample<8*nSSC; nSample++) { // Samples in all subsubframes 
SurroundLef t [nSample] = Slef t [nSample] + Sright [nSample] ; 
Surroundright [nSample] = Slef t [nSample] - Sright [nSample] ; 

} 



C.2.5 Filter Bank Reconstruction 

Having prepared all the subband samples, it is time to go through subband interpolation to reconstruct the PCM samples 
for each primary channel. As discussed before, there are two filter banks, one for perfect reconstruction and the other 
for non-perfect. The encoder indicates its choice to the decoder through the FILTS flag in the frame header. 

for (ch=0; ch<nPCHS; ch++) 

aPrmCh[ch] . QMFInterpolation (FILTS , nSUBS [ch] ) ; 

// FILTS indicates which filter bank to use 

// nSUBS [ch] indicates the number of active subbands. Subbands 
// above it are all zeros. For joint intensity coded subbands, 
// it is set to that of the source channel, in order to 
// reflect the true subband activity. 

There are many methods to efficiently implement the reconstruction filter bank. We present only one of them which we 
think is fairly efficient. The two sets of 512 FIR coefficients are tabulated in clause D.8 to include both perfect 
reconstruction and nonperfect reconstruction and the selection is flagged by FILTS in the frame header. 

The first step is to pre -calculate the cosine modulation coefficients: 

PreCalCosMod ( ) { 

for <j=0,k=0;k<16;k++) 

for (i=0 ; i<16 ; i++) 

raCosMod[j+ + ] = (real ) cos ( (2*i + l )* <2*k+l ) *Pi/64 ) ; 
for (k=0;k<16;k++) 

for (i=0 ; i<16 ; i++) 

raCosMod [ j ++] = (real) cos { (i) * (2 * k + l)*Pi / 32); 
for (k=0 ;k<16 ;k++) 

raCosMod [ j ++] = real(0. 25/(2 * cos((2 * k + l)*Pi / 128))); 
for (k=0;k<16;k++) 

raCosMod[j+ + ] = real ( - . 25/ (2 * sin((2 * k + l)*Pi / 128))); 

} 

The filter bank reconstruction is illustrated by the following pseudocode: 

QMFInterpolation (FILTS, int nSUBS) { 
// Select filter 

if ( FILTS==0 ) // Non-perfect reconstruction 

prCoeff = raCoeff Lossy; 
else // Perfect reconstruction 

prCoeff = raCoef f LossLess ; 
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// Interpolation begins 

nChlndex =0; // Reconstructed channel sample index 

for (nSubIndex=nStart ; nSubIndex<nEnd; nSubIndex++) { // Subband samples 

// Load in one sample from each subband 
for (i=0; i<nSUBS; i++) 

raXin[i] = aSubband [i] . raSample [nSublndex] ; 
for (i=nSUBS; i<NumSubband; i++) // Clear inactive subbands 

raXin[i] = 0.0; 

//Multiply by cosine modulation coefficients and 
// Create temporary arrays SUM and DIFF. 
for (j=0,k=0;k<16;k++) { 

A[k] = (real) 0.0; 

for (i=0;i<16;i++) 

A [k] += (raXin [2*i] +raXin [2*i+l] ) *raCosMod [ j + + ] ; 

} 

for (k=0;k<16;k++) { 
B[k] = (real) 0.0; 
for (i=0 ; i<16 ; i++) { 
if(i>0) 

B [k] += (raXin [2*i] +raXin [2*i-l] ) *raCosMod [j+ + ] ; 

else 

B [k] += (raXin [2*i] ) *raCosMod [ j + + ] ; 

} 

SUM [k] =A[k] +B [k] ; 
DIFF [k] =A[k] -B [k] ; 

} 

// Store history 
for (k=0;k<16;k++) 

raX [k] =raCosMod [ j + + ] *SUM [k] ; 
for (k=0;k<16;k++) 

raX [32-k-l] =raCosMod [j+ + ] *DIFF [k] ; 

// Multiply by filter coefficients 
for (k=31, i=0;i<32;i++,k--) 
for (j=0;j<512; j+=64) 

raZ[i] += prCoeff [i + j] * (raX[i + j] -raX[j+k] ) ; 
for (k=31, i=0;i<32;i++,k--) 
for (j=0; j<512; j+=64) 

raZ[32 + i] += prCoef f [32 + i + j ] * ( -raX [i + j ] -raX [ j +k] ) ; 

// Create 32 PCM output samples 
for (i=0 ; i<32 ; i++) 

naCh [nChIndex++] = int (rScale*raZ [i] ) ; 

// Update working arrays 
for (i=511;i>=32;i--) 

raX [i] = raX [i-32] ; 
for (i=0 ; i<NumSubband; i++) 

raZ [i] = raZ [i + 32] ; 
for (i=0 ; i<NumSubband; i++) 

raZ[i+32] = (real) 0.0; 




C.2.6 Interpolation of LFE Channel 

void InterpolationFIR ( int nDecimationSelect ) { 

// rLFE : An array holding decimated samples. 
// Samples in current subframe starts from rLFE[0] , 
// while rLFE [ - 1] , rLFE [-2], stores samples 

// from last subframe as history. 

// naCh: An array holding interpolated samples 
// Select decimation filter 

if (nDecimationSelect==l ) {// 128 decimation 

nDeciFactor = 128; // Decimation factor = 128 

prCoeff = raCoeffl28; // Point to the 128X FIR coefficient array 

} 

else { // 64 decimation 
nDeciFactor = 64; 
prCoeff = raCoeff64; 

} 

// Interpolation 
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NumFIRCoef = 512; // Number of FIR coefficients 

nlnterplndex = 0; // Index to the interpolated samples 

for (nDeciIndex=0 ; nDeciIndex<nNumDeciSample ; nDeciIndex++ ) { 

// One decimated sample generates nDeciFactor interpolated ones, 
for (k=0; k<nDeciFactor ; k++) { 
// Clear accumulation 
rTmp = 0.0; 
// Accumulate 

for (J=0; J<NumFIRCoef /nDeciFactor ; J++) 

rTmp += rLFE [nDecilndex- J] *prCoef f [k+ J*nDeciFactor] ; 
// Save interpolated samples as integer 
naCh [nlnterplndex++] = (int)rTmp; 

} 

nDeciIndex++ ; // Next decimated sample 




C.3 Coefficients for Remapping Loudspeaker Locations 

The coefficients that control the remapping of loudspeaker locations are transmitted using the 5 bit codes, each 
corresponding to an index into a scale factor lookup table. 

The range of coefficients is between -15 dB and dB in steps of 0,5 dB. A subset of the Scale Factor Table (see 
clause D.l 1) is used to map the 5-bit codes to 16-bit fixed point values. The table entries are unsigned 16 bit integer 
numbers representing the numbers in column AbsValues of the same table, after multiplication by 2 15 and rounding to 
the nearest integer value. 

The coefficients (SpkrRemapCoeff) are obtained from the transmitted 5 bit codes (SpkrRemapCodes) in the following 
manner: 

Tbllndex = (nuSpkrRemapCodes << 2) + 120; 
if (Tbllndex > 240) { 

Error: Invalid Index For a Speaker Remapping Coefficient 

} 

SpkrRemapCoeff = LinScalesTable [Tbllndex] ; 



C.4 Post Mix Gain Adjustment 

The scale factors that adjust the gain in all channels after combining audio assets are transmitted using the 6 bit codes, 
each corresponding to an index into a scale factor lookup table. The range of scale factors is between -15 dB and 15 dB 
in steps of 0,5 dB. 

The Scale Factor Table (see clause D. 1 1) is the lookup table used to map the 6-bit codes to values available in the 
LinScalesTable column. The table entries are unsigned 16-bit integers representing the numbers in column AbsValues 
of the same table, after multiplication by 2 15 and rounding to the nearest integer value. 

The coefficient (PostMixGainAdj) is obtained from the transmitted 6-bit code (nuPostMixGainAdjCode) in the 
following manner: 

if (nuPostMixGainAdjCode > 60) { 

Error: Invalid Index For a Post Mix Gain Adjustment Code 

} 

if (nuPostMixGainAdjCode == 30) 

// dB gain adjustment 
PostMixGainAdj = 1 << 20; 

else 
{ 

// Look-up linear scale corresponding to range from -60 dB to -30 dB 
PostMixGainAdj = LinScalesTable [nuPostMixGainAdj Code] ; 

// Translate from range [-60 dB, -30 dB] to [-15 dB, +15 dB] 
// 5827066 (Q15) -> 177.8279 -> +45 dB 

// Also translate the PostMixGainAdj from 16-bit in Q15 to 24-bit in Q20 
PostMixGainAdj = (PostMixGainAdj *5827066 + (1 << 9)) >> 10; 

} 
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The resulting coefficient PostMixGainAdj is a 24-bit unsigned fixed-point number in Q20 format. 



C.5 Coefficients for Mixing Audio Assets 

The coefficients that control the mixing audio assets are transmitted using the 6-bit codes, each corresponding to an 
index into a scale factor lookup table. 

The range of scale factors is between -60 dB and dB with addition of -co (ScaleFactor = 0) which is coded with a code 
0. Furthermore the range [-60 to 0] is subdivided into 3 regions each with different grid resolution: 

1) [-60 to -30] with resolution of 2 dB 

2) [-29 to -15] with resolution of ldB 

3) [-14,5 to 0] with resolution of 0,5 dB 

A subset of the Scale Factor Table (see clause D.l 1) is used to correlate the index to the integer scalars. The table 
entries are unsigned 16 bit integers representing the numbers in column Abs Values of the same table, after 
multiplication by 2 15 and rounding to the nearest integer value. 

The coefficients (ScaleFactors) are obtained from the transmitted 6-bit codes (CoeffCodes) in the following manner: 

if (CoeffCodes == 0) 
ScaleFactors = 
else { 

ScaleFactorsTbllndex = (CoeffCodes - 1) << 2 ; 
if (ScaleFactorsTbllndex > 240) 

Error: Invalid Index For a Scale Factor 
ScaleFactors = LinScalesTable [ScaleFactorsTbllndex] ; 

} 



C.6 Smoothing the Coefficient Transitions 

The downmix coefficients, mixing and scaling coefficients may change their values from frame to frame. In order to 
ensure a smooth transition, the coefficient interpolation is performed over the frame that carries the new values for the 
coefficients. The interpolation is performed on all coefficients in a similar manner. In particular the coefficient linear 
interpolation is performed over the nFrmSize sample intervals. The value of actual coefficient Coeff(n) to be used at the 
time instance n (n = corresponds to the first sample in the current frame) is obtained using the procedure outlined 
below: 

Delta k = ScaleFactors k - ScaleFactors k _i ; 

nShift = log 2 (nFrmSize) ; // nFrmSize = 2 nShlft 
if ( |Delta k | > ) { 

Ramp = 0; // Ramp needs to be 32 -bit variable 
for (n=0; n<nFrmSize; n++) { 

Coef f [n+ + ] = ScaleFactors k _i + (Ramp + ( 1<< (nShif t- 1 ) ) ) >>nShif t ; 
Ramp += Delta k ; 

} 

} 

else{ 

for (n=0; n<nFrmSize; n++) 

Coef f [n] = ScaleFactors k ; 

} 

where the » indicates the right shift operation, the « indicates the left shift operation and the ScaleFactors k are the 
coefficients transmitted in the stream in the frame k. 

The resulting values of Coeff(n) are in the same fixed-point number format as the original ScaleFactors k . The new 
coefficient value (ScaleFactors k ) is reached over the period that corresponds to nFrmSize samples. 
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C.7 Entropy Coding 

Entropy coding removes redundancy from the residual signal e(n) without any loss of information. In DTS-HD lossless 
codec, two types of codes-Rice codes and Binary codes-are used. Both codes are simple to implement on the encoder 
and the decoder side. Binary codes are optimal codes for a uniform distribution of numbers that are to be coded. On the 
other hand, the Rice codes are Huffman codes for a Laplacian distribution of numbers that need to be coded. As it turns 
out, the distribution of the most of the residual signals obtained after linear prediction are close to the Laplacian. Highly 
uncorrected input audio signals are an exception; their residual distribution is closer to uniform. 

Binary code is characterized by a single parameter BitWidth that represents the number of bits used for each code word. 
The code word is just a binary representation of unsigned integer numbers in the range between and 

2BitWidth . j 

The Rice codes are also characterized by a single parameter, denoted as KRice. The Rice code with parameter KRice for 
an unsigned integer valued residual e(n) is constructed from two parts: 

1) A unary representation of {u(n)»KRice } where » denotes a right shift operation and a unary representation 
of number k consisting of k "0" bits followed by a single stop bit "1" (i.e. unary representation of 5 is 000001). 

2) KRice least significant bits of e(n), i.e. KRice bit binary representation of{u(n) & (1« KRice -1)} where « 
denotes a left shift operation and & denotes a bitwise AND operation. 

The above definitions for both Binary and Rice codes assume coding of the unsigned integer numbers. Audio samples, 
as well as the prediction residual samples (both denoted by e(n)) in our codec, are signed integers and prior to the 
entropy coding, these samples need to be translated to the unsigned integer representation u(n) using the following 
mapping: 




e(n) « 1, Ve(n) > 
((-e(n)) «!)-!, Ve(«)<0 



where « denotes the left shift operation. 

The coding parameter for both types of codes is determined for the duration of one segment. The choice of the code 
(Binary or Rice) is also determined for one segment. 

In a channel set, the code selection (bRiceCodeFlag) and the corresponding coding parameter (ncABIT) can be chosen: 

1) For each channel separately (ncSegType = 0) i.e. all samples in the segment of channel 1 are coded using 
(bRiceCodeFlag 1, ncABITl), all samples in the segment of channel 2 are coded using (bRiceCodeFlag2, 
ncABIT2). 

2) For all channels jointly (ncSegType =1) i.e. all samples in the segment for all channels in the channel set are 
coded using single bRiceCodeFlag and single ncABIT. 

As indicated above, the selection between 1) and 2) is carried in the parameter ncSegType. 

For one channel set in one segment, the following components are packed: 

1) ncSegType 

2) bRiceCodeFlag [] (1 per channel when ncSegType=0 or single for all channels when ncSegType =1) 

3) ncABIT[] (1 per channel when ncSegType=0 or single for all channels when ncSegType =1) 

4) the entropy codes for all samples in one segment of channel 1 

5) the entropy codes for all samples in one segment of channel 2 

6) the entropy codes for all samples in one segment of channel 3 

On the decode side the entropy codes are extracted, as unsigned numbers u(n), according to the previously extracted 
coding parameters (ncSegType, bRiceCodeFlag[] and ncABIT[]). 
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To reconstruct signed integer representation of residual samples e(n) from the extracted unsigned integer numbers u(n), 
the following mapping is performed: 

\u(n) » 1, V even numbered u(n) 
e(n) = < 

[— (u(n) » 1) — 1, V odd numbered u(n) 



C.8 Downmix Coefficients 

The remainder of this text defines the following parameters: 

1) DmixCoeff as the fixed-point representation for entries of matrix Dmix_Mtrx 

2) DmixScale as the fixed-point representation for entries of matrix Scale_Mtrx 

3) InvDmixScale as the fixed-point representation for entries of matrix InvScale_Mtrx 

Both the DmixCoeff and DmixScale parameters are transmitted using 9-bit codes DmixCoeffCodes and 
DmixScaleCodes respectively. Each consists of a sign and an index for the table lookup of its absolute value 
IDmixCoeffl or IDmixScalel. 

Since both the DmixScale and the InvDmixScale are needed on the decode side, there are two look-up tables, as defined 
in clause D.ll: 

• DmixTable[ ] is used for finding the absolute value of both DmixCoeff and DmixScale parameters 

• InvDmixTable[ ] is used for finding the absolute value of InvDmixScale parameter 

Note that the range of the entries in DmixTable[ ] is between -60 dB and dB with addition of -co (IDMixCoeffl =0), 
which is coded with a DmixCode=0. Furthermore, the range [-60 to 0] is subdivided into 3 regions, each with a different 
grid resolution: 

1) [-60 to -30] with resolution of 0,5 dB 

2) [-29,75 to -15] with resolution of 0,25 dB 

3) [-14,875 to 0] with resolution of 0,125 dB 

Although the DmixScale parameters are obtained from the DmixTable[ ], their range is limited, on the encode side, to 
[-40 dB, dB]. Consequently the InvDmixTable[ ], the look-up table for the InvDmixScale parameters, has entries that 
correspond to the inverse of IDmixScalel over the limited range [-40 dB to dB]. 

The entries in DmixTable column of clause D.l 1 are unsigned 16-bit integer numbers representing the numbers in 
column AbsValues of clause D.l 1, after multiplication by 2 15 and rounding to the nearest integer value. The 
DMixCoeff are signed integer numbers obtained from the entries of DmixTable after multiplication by the sign value 
DmixSign. 

The DmixCoeff parameters are obtained from the transmitted 9-bit codes DmixCoeffCodes in the following manner: 

1) Extract Sign Bit: the most significant bit represents the sign bit such that: 

(DmixCoeffCodes and 0xl00)»8 = 1 -> DmixSign = 1 
(DmixCoeffCodes and 0xl00)»8 = -> DmixSign = -1 

2) Look up the IDmixCoeffl: the lower 8 bits of the DmixCoeffCodes represent the index into the DmixTable []; 
the DMixCoeff parameters are calculated in the following manner: 

if ((DmixCoeffCodes & OxFF) == 0) 

DMixCoeff = 0; 
else { 

DmixTbllndex = (DmixCoeffCodes & OxFF) - 1; 
if (DmixTbllndex > 24 0) 

Error: Invalid Index for a Downmix Coefficient 

DMixCoeff = DmixSign *DmixTable [DmixTbllndex] ; 
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} 

The DmixScale parameters are obtained from the transmitted 9-bit codes DmixScaleCodes in the following manner: 

1) 1) Extract Sign Bit: the most significant bit represents the sign bit such that: 

(DmixScaleCodes and 0xl00)»8 = 1 -> DmixScaleSign = 1 
(DmixScaleCodes and 0xl00)»8 = -> DmixScaleSign = -1 

2) Look up the IDmixScalel: the lower 8 bits of the DmixScaleCodes represent the index into the DmixTable[ ]; 
a) the DmixScale parameters are calculated in the following manner: 

DmixScaleTbllndex = (DmixScaleCodes & OxFF) - 1 
If {(DmixScaleTbllndex < 40) || (DmixScaleTbllndex >240) ) 
Error: Invalid Index for a Downmix Scaling Parameter 
DmixScale = DmixScaleSign x DmixTable [DmixScaleTbllndex] ; 

The entries in column InvDmixTable of clause D.l 1 are unsigned 24-bit integer numbers representing the numbers from 
column InvAbs Values of clause D.l 1, after multiplication by 2 16 and rounding to the nearest integer value. The 
InvDmixScale are signed 24-bit integer numbers obtained from the entries of InvDmixTable [ ] after multiplication by 
the sign value DmixScaleSign. 

The InvDmixScale parameters are obtained from already calculated DmixScaleSign and DmixScaleTbllndex 
parameters as follows: 

InvDmixScale = DmixScaleSign * InvDmixTable [DmixScaleTblIndex-40] ; 



The decoder does not use DmixCoeff and InvDmixScale separately. It uses their product instead. The resulting 
UndoDmixScale parameter is calculated using fixed-point arithmetic as follows: 

UndoDmixScale = ( InvDmixScale * DmixCoeff + (1<<15) ) >>16 

where « and » denote left and right integer shift operators. The UndoDmixScale is a 24-bit signed integer 
represented in Q15 fixed point representation (i.e. all its values are scaled by 2 15 ). 
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Annex D (normative): 
Large Tables 

D.1 Scale Factor Quantization Tables 



D.1 .1 6-bit Quantization (Nominal 2,2 dB Step) 



Index 


Quantization level 


Quantization level 
in dB 





1 


0,0 


1 


2 


6,0 


2 


2 


6,0 


3 


3 


9,5 


4 


3 


9,5 


5 


4 


12,0 


6 


6 


15,5 


7 


7 


17,0 


8 


10 


20,0 


9 


12 


21,5 


10 


16 


24,0 


11 


20 


26,0 


12 


26 


28,3 


13 


34 


30,6 


14 


44 


32,8 


15 


56 


35,0 


16 


72 


37,2 


17 


93 


39,4 


18 


120 


41,6 


19 


155 


43,8 


20 


200 


46,0 


21 


257 


48,2 


22 


331 


50,4 


23 


427 


52,6 


24 


550 


54,8 


25 


708 


57,0 


26 


912 


59,2 


27 


1175 


61,4 


28 


1514 


63,6 


29 


1950 


65,8 


30 


2512 


68,0 


31 


3236 


70,2 



Index 


Quantization level 


Quantization level 
in dB 


32 


4169 


72,4 


33 


5370 


74,6 


34 


6918 


76,8 


35 


8913 


79,0 


36 


11482 


81,2 


37 


14791 


83,4 


38 


19055 


85,6 


39 


24547 


87,8 


40 


31623 


90,0 


41 


40738 


92,2 


42 


52481 


94,4 


43 


67608 


96,6 


44 


87096 


98,8 


45 


112202 


101,0 


46 


144544 


103,2 


47 


186209 


105,4 


48 


239883 


107,6 


49 


309030 


109,8 


50 J 


398107 


112,0 


51 


512861 


114,2 


52 


660693 


116,4 


53 


851138 


118,6 


54 


1 096478 


120,8 


55 


1412538 


123,0 


56 


1819701 


125,2 


57 


2344229 


127,4 


58 


3019952 


129,6 


59 


3890451 
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63 
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invalid 



D.1 .2 7-bit Quantization (Nominal 1 ,1 dB Step) 



Index 


Quantization level 


Quantization Level (dB) 





1 


0,0 


1 


1 


0,0 


2 


2 


6,0 


3 
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6,0 
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6,0 
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2 


6,0 
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7 


3 
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3 


9,5 
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12,0 


10 


4 


12,0 



Index 


Quantization level 


Quantization Level (dB) 


11 


5 


14,0 


12 


6 


15,5 


13 


7 j 


17,0 


14 


7 


17,0 


15 


8 


18,0 


16 


10 


20,0 


17 


11 


21,0 


18 


12 


21,5 


19 


14 


23,0 


20 


16 


24,0 


21 


18 


25,1 
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D.2 Quantization Step Size 
D.2.1 Lossy Quantization 



ABITS Index 


Step-sizex2" 


Nominal Step-size 








0,0 


1 


6710886 


1,6 


2 


4194304 


1,0 


3 


3355443 


0,8 


4 


2474639 


0,59 


5 


2097152 


0,50 


6 


1761608 


0,42 


7 


1426063 


0,34 


8 


796918 


0,19 


9 


461373 


0,11 


10 


251658 


0,06 


11 


146801 


0,035 


12 


79692 


0,019 


13 


46137 


0,011 


14 


27263 


0,0065 


15 


16777 


0,0040 


16 


10486 


0,0025 


17 


5872 


0,0014 


18 


3355 


0,0008 


19 


1887 


0,00045 


20 


1258 


0,00030 


21 


713 


0,00017 


22 


336 


0,00008 


23 


168 


0,00004 


24 


84 


0,00002 


25 


42 


0,00001 


26 


21 


0,000005 


27 


invalid 


invalid 


28 


invalid 


invalid 


29 


invalid 


invalid 


30 


invalid 


invalid 


31 


invalid 


invalid 
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ABITS Index 


Step-sizex2^ 


Nominal Step-size 








0,0 


1 


4194304 


1,0 


2 


2097152 


0,5 


3 


1384120 


0,33 


4 


1 048576 


0,25 


5 


696254 


0,166 


6 


524288 


0,125 


7 


348127 


0,083 


8 


262144 


0,0625 


9 


131072 


0,03125 


10 


65431 


0,0156 


11 


33026 


7,874e-3 


12 


16450 


3,922e-3 


13 


8208 


1 ,957e-3 


14 


4100 


9,775e-4 


15 


2049 


4,885e-4 


16 


1024 


2,442e-4 


17 


512 


1,221e-4 


18 


256 


6,104e-5 


19 


128 


3,052e-5 


20 


64 


1 ,526e-5 


21 


32 


7,629e-6 


22 


16 


3,81 5e-6 


23 


8 


1 ,907e-6 


24 


4 


9,537e-7 


25 


2 


4,768e-7 


26 


1 


2,384e-7 


27 


invalid 


invalid 


28 


invalid 


invalid 


29 


invalid 


invalid 


30 


invalid 


invalid 


31 


invalid 


invalid 
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D.3 Scale Factor for Joint Intensity Coding 



Index 


Scale Factor 





A AOCAOO 

0,025088 


H 
1 


a aoaao>i 

0,0^ob24 


2 


0,0^olo 


3 


A AO AO O /I 

0,029824 


4 


0,031 61 fo 


5 


0,033472 


a 
6 


A AO C /I C A 

0,035456 


7 


A AO"7CAO 

0,037568 


o 



A AOAOAO 

0,039808 


A 


0,0421 76 


A A 

10 


A A A A A"70 

0,044672 


A -{ 

1 1 


A A A TOAA 

0,047296 


1 2 


a ac a h h o 

0,0501 12 


a o 


0,05312 


H A 

1 4 


A ACAOCA 

0,056256 


15 


A AC ACO yl 

0,059584 


^ a 
16 


0,063104 


1 7 


A A AAO H A 

0,066816 


18 


A A"7A"70>1 

0,070784 


H A 

19 


A A"7CAAO 

0,075008 


OA 

20 


A A"7A A O A 

0,079424 


o h 

21 


0,08416 


22 


A nofnco 

0,089152 


oo 


A AQyl A 

0,U944 


24 


0,099968 


25 


0,10592 


26 


0,112192 


27 


0,118848 


28 


0,125888 


29 


0,133376 


30 


0,141248 


31 


0,149632 



Index 


Scale Factor 


32 


A A C O A A A 

0,1 58464 


oo 

33 


A H A "70 ~7 O 

0,1 o7o72 


O A 

34 


A H 770Cr 

0,1 77856 


oc 

35 


0,1 88352 


36 


A H A A£T £T O 

0,1 99552 


0"7 

37 


A OH H O OO 

0,21 1 328 


oo 

38 


A 0000"70 

0,223872 


OA 

39 


A 1074 O 

0,2371 2 


A A 

40 


A OC H O 

0,2512 


A A 

41 


A O A AA A O 

0,266048 


A O 

42 


A OO H OCA 

0,281856 


A O 

43 


A OAOCA 

0,29856 


A A 

44 


A O A AOO yl 

0,31 6224 


A C 

45 


A OO A A"7A 

0,334976 


46 


A O C /I O H A 

0,354816 


A ~7 

47 


A 17COnO 

0,375808 


A O 

48 


A OAOAO 

0,39808 


A A 

49 


A A O A A A A 

0,421696 


C A 

50 


A A A AACA 

0,446656 


c ^ 

51 


A A "70 1C1 

0,4731 52 


CO 

52 


A C A H H O /I 

0,501 184 


CO 

53 


A COAOO 

0,53088 


C A 

54 


A C AOO A O 

0,562368 


55 


A C AC A/1 Q 

0,595648 


56 


0,630976 


57 


0,668352 


58 


0,707968 


59 


0,749888 


60 


0,794304 


61 


0,841408 


62 


0,891264 


63 


0,944064 



Index 


Scale Factor 


64 


1 


AC 

65 


A ACAOA 

1 ,05926 


A A 

66 


A A OOAC 

1 ,1 2205 


A"7 

67 


H 4 00/10 

1 ,1 8848 


68 


1 ,25894 


69 


1 ,3335 


"7A 

70 


A A A OC A 

1 ,41 254 


—J A 

71 


A ylAAOA 

1 ,49626 


"70 

72 


A CO ACi 

1 ,5849 


"70 

73 


A 07070 

1 ,67878 


74 


A 7700 

1 ,7783 


"7C 

75 


A OOOAC 

1 ,88365 


"7A 

76 


A AACOA 

1 ,99526 


-7-7 

77 


O A A O A ~7 

2,1 1347 


78 


O 00070 

2,23872 


"7A 

79 


O 0710fl 

2,37139 


OA 

80 


O C^07 

2,51 1 87 


O A 

81 


O AAA"7/I 

2,66074 


00 
82 


O O A 007 

2,81837 


83 


O AO C A H 

2,98541 


O A 

84 


O A AOO 

3,1623 


oc 

85 


O O A AAO 

3,34963 


OA 

86 


O C A O H A 

3,5481 6 


til 


3, /584 


88 


3,98106 


89 


4,21696 


90 


4,46682 


91 


4,73152 


92 


5,0119 


93 


5,30886 


94 


5,62342 


95 


5,95661 



Index 


Scale Factor 


96 


6,30957 


A"7 

97 


A AOO A A 


AO 

98 


7 A "7 A <1 A 

7,07949 


99 


7 /I A OA >1 

7,49894 


1 00 


7,9433 


HAH 
1 01 


O A H O A CT 

8,41 395 


A AO 

1 02 


O A H OC H 

8,91 251 


A AO 

1 03 


A /I /I AA yt 

9,44064 


A A A 

1 04 


A A 

10 


A AC 

1 05 


H A CAOC 

10,5925 


A AA 

1 06 


A A OOAO 

1 1 ,2202 


A A"7 

1 07 


A A O OC 

1 1 ,885 


1 08 


HOC OAO 

12,5892 


A AA 

1 09 


A O OOCO 

13,3352 


1 1 


A A A OC yl 

1 4,1 254 


AAA 
111 


H yl A AO yt 

1 4,9624 


A A O 

112 


H C O /I A 

15,849 


H A O 

1 1 3 


H A "TOO 

16,788 


AAA 

114 


A ~7 "7000 

1 7,7828 


A A C 

1 1 5 


HO O O A £T 

18,8365 


H H A 

1 1 6 


H A ACOA 

19,9526 


A A ~7 

117 


OH A O A A 

21 ,1 349 


AAO 

1 1 8 


OO 00"70 

22,3872 


1 1 9 


23,7137 


1 90, 


oc h h 00 
*lO, I I 00 


121 


26,6072 


122 


28,1838 


123 


29,8538 


124 


31,6228 


125 


33,4965 


126 


35,4813 


127 


37,5837 


128 


39,8107 



D.4 Dynamic Range Control 



Index 


Q18 binary 


Multiplier 


Log Multiplier (dB) 





0,00040394 


0,0259 


-31,7500 


1 


0,00041574 


0,0266 


-31,5000 


2 


0,00042788 


0,0274 


-31,2500 


3 


0,00044037 


0,0282 


-31,0000 


4 


0,00045323 


0,0290 


-30,7500 


5 


0,00046647 


0,0299 


-30,5000 


6 


0,00048009 


0,0307 


-30,2500 


7 


0,00049411 


0,0316 


-30,0000 


8 


0,00050853 


0,0325 


-29,7500 


9 


0,00052338 


0,0335 


-29,5000 


10 


0,00053867 


0,0345 


-29,2500 


11 


0,00055440 


0,0355 


-29,0000 


12 


0,00057058 


0,0365 


-28,7500 


13 


0,00058725 


0,0376 


-28,5000 


14 


0,00060439 


0,0387 


-28,2500 


15 


0,00062204 


0,0398 


-28,0000 



Index 


Q18 binary 


Multiplier 


Log Multiplier (dB) 


16 


0,00064021 


0,0410 


-27,7500 


17 


0,00065890 


0,0422 


-27,5000 


18 


0,00067814 


0,0434 


-27,2500 


19 


0,00069794 


0,0447 


-27,0000 


20 


0,00071832 


0,0460 


-26,7500 


21 


0,00073930 


0,0473 


-26,5000 


22 


0,00076089 


0,0487 


-26,2500 


23 


0,00078311 


0,0501 


-26,0000 


24 


0,00080597 


0,0516 


-25,7500 


25 


0,00082951 


0,0531 


-25,5000 


26 


0,00085373 


0,0546 


-25,2500 


27 


0,00087866 


0,0562 


-25,0000 


28 


0,00090432 


0,0579 


-24,7500 


29 


0,00093072 


0,0596 


-24,5000 


30 


0,00095790 


0,0613 


-24,2500 


31 


0,00098587 


0,0631 


-24,0000 
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/ , / 1 /y 


1 7 7CAA 
1 / , / OUU 


1 QQ 

i yy 


A 1 O/l 1 1 07Q 

u,i £i4i i o/y 


7 Q/1 OO 

/,y4oo 


1 Q AAAA 

1 o,UUUU 


dUU 


A 1 07707Q7 

u,i £i / /o/y/ 


Q 1 7CO 
0,1 lOd 


1 Q OCAA 
1 O,£l0UU 


oai 


A 1 Q1 /I C7QQ 

u,i oi 4b/yy 


Q A 1 /I A 
o,41 4U 


1 Q CAAA 
1 O,0UUU 


oc\o 
dUd 


A 1 QCQACQQ 

U,l oOoUbyo 


Q CCQC 

o,boyb 


1 Q 7CAA 
1 O, / OUU 


oao 


A 1 QQOC7QC 

u,i oy^o/yb 


Q Q1 OC 

o,y i £io 


1 Q AAAA 

i y,uuuu 


OC\A 

dun- 


A 1 A QQO/1 OC 

U,l 4oo*i4ob 


Q 1 70Q 

y, i / do 


1 Q OCAA 

i y,*iouu 


**_uo 


a 1/17caqci 
u, i *+/ ouyo i 


Q /l/IAfi 


1 Q CAAA 

i y,ouuu 


206 


0,15181687 


9,7163 


19,7500 


207 


0,15625000 


10,0000 


20,0000 


208 


0,16081258 


10,2920 


20,2500 


209 


0,16550839 


10,5925 


20,5000 


210 


0,17034133 


10,9018 


20,7500 


211 


0,17531538 


11,2202 


21,0000 



InHov 

II lUCA 


01 R hinarx/ 
Vx I O Ullldi y 


l\/li iltinlioi* 
IVIUI 11 unci 


1 rtn Mi iltinlioi* /HR^ 


01 o 
d\d 


A 1 QA/1 O/l CQ 

U,l oU4o4by 


1 1 C/1 7Q 

1 1 ,04/0 


01 OCAA 
d \ ,dO\J\J 


01 o 
d\ o 


A 1 QC7AO/1 7 

U,l O0/Uo4/ 


I 1 QQCA 

I I ,ooOU 


01 CAAA 

d \ ,ouuu 


01 A 


A 1 Q1 1 OC1 1 

u,i yi i do\ i 


1 O 0001 

1 d,dod\ 


01 7CAA 

d \ , / OUU 


01 C 

d\ 


A 1 QC7A71 A 

u,i yb/u/ 1 u 


1 O CQQO 


OO AAAA 


01 c 
d\ b 


A OC\OA C1 AC 
U,£lU£l401 UO 


1 O QCCQ 

i ^,yoby 


OO OCAA 
dd,dO\J\J 


01 7 

d \ / 


A 0AQ0C070 

\J,d\JooOd/ d 


1 o ooco 
1 o,oo0^ 


OO CAAA 
£l^,0UUU 


01 Q 
d\ 


A O H A A A 7 AO 

U,£ll 444/Uo 


1 O ~70A C 
1 O, / dQ-O 


OO 7CAA 

dd, /OUU 


01 Q 

d\ y 


A OOA7AQQQ 

u,£i£iu/uoyy 


1/11 OC/1 

14,1 £i04 


OO AAAA 
riO,UUUU 


OOA 

ddU 


A 0071 COQ1 

\J,ddf 1 OoOl 


1 /I C07Q 

1 4, Do/0 


OO OCAA 


001 


A 0007QCQ0 

U,£ioo/0b0£i 


1 /I OPOA 

1 4,yb^4 


OO CAAA 
riO,0UUU 


ooo 
ddd 


A OA AC1 OCO 

U,£i4Ubl oOd 


1 C OQQO 

i o,oyyo 


OO 7CAA 

do, 1 OUU 


ddo 


A 0/17C0QCC 

U,£i4/boy0b 


1 C Q/l QQ 

i o,o4oy 


OA AAAA 

£i4,UUUU 


OOA 

ddQ 


A OC/1Q7A77 

U,^i040/U/ / 


1 C 01 1 7 

1 b,ol 1 / 


OA OCAA 
r^4,ilOUU 


ooc 
ddo 


A OCOO 1 O 1 O 

\J,dodo\ ol o 


1 C 7QQA 
1 b, /oOU 


OA CAAA 

£i4,0UUU 


OOP. 

ddo 


A 0CQQ70Q1 

u,^byy z^ioi 


1 7 07Q0 

1 / ,d/oo 


OA 7CAA 

£i4, / OUU 


OOI 

ddl 


A 077QCC1 C 

u,£i / /oobi b 


1 7 7QOQ 

1 / , /o<r:0 


OC AAAA 
riO,UUUU 


OOQ 

ddo 


A OQCQCQ7A 

u,£iooyby /u 


1 Q OAOl 
1 0,oU*il 


OC OCAA 


OOQ 

dd\5 


A OQ/1 OOA1 7 

u,£iy4o*iui / 


1 Q QOCC 

1 0,oob0 


OC CAAA 

(io,ouuu 


OOA 


A OAOQ1 A A 7 

U,oU£iy 1 44/ 


1 Q OQCC 

i y,oobo 


OC 7CAA 

do, I OUU 


OO 1 

*iol 


A 01 1 7CQ7/1 

u,oi i / oy /4 


1 Q QCOC 

i y,yo*ib 


OC AAAA 

£ib,uuuu 


ooo 
dod 


A OOAQCOOQ 

U,O£iU0bO£:y 


OA COCO 

£iU,OoOo 


OC OCAA 

£:b,£iOUU 


ooo 
doo 


A OOAOOOCC 

U,ooU*io*ibb 


01 1 O/l Q 

d\ ,1 o4y 


OC CAAA 

<ib,ouuu 


OO.A 


A OOQQ7CCO 

u,ooyo/obo 


01 7COA 
d\ , fOd\J 


OC 7CAA 

do, I OUU 


ooc 

doO 


A O/l QQAA1 Q 

u,o4youui o 


OO 0Q70 
dd,OOf d 


07 AAAA 

d /,uuuu 


ooc 
*iob 


A OCAA1 A CO 

U,obUUl 40o 


OO A/1AQ 

^o,u4uy 


07 OCAA 

d I ,do\}\} 


007 

do/ 


A 07AC071 A 

U,o /UDdf 1 4 


OO 71 07 

£lO, / 1 o/ 


07 CAAA 
£l/,0UUU 


OOQ 

*io0 


A OQ1 O/l C70 

U,ool o4b/o 


OA /1ACO 

^4,4Ub^i 


07 7CAA 
dl ,1 OUU 


OOQ 


A OQO/1 QOOC 
U,oy<l40£l*l0 


OC 1 1 QQ 

do,\ i oy 


OQ AAAA 


OA A 


A A AOQ/1 OQ/1 

u,4Uoy4iiy4 


OC QCOO 

£i0,o0/-:o 


OQ OCAA 
do,dO\}\J 


OA 1 


A A 1 C70QOQ 

u,4i o/oo^y 


OC CA70 

<ib,bU/o 


OQ CAAA 
£O,0UUU 


OA O 
dQd 


A /107Q7QA7 

U,4£i /0/OU/ 


07 OQ/1 O 

^/,oo4ii 


OQ 7CAA 
do, /OUU 


OA O 


A /1/1A07000 

U,44Uo/iioo 


OQ 1 QOQ 


OQ AAAA 

^y,uuuu 


OA A 
£l44 


A A COOO 1/1/1 

U,40o£iol 44 


OQ AACQ 

*iy,uubo 


OQ OCAA 
d\d,dO\}\J 


OA C 


A /1CC/1CCAO 

U,4bb4bbUo 


OQ QCOQ 

<-:y,oDoO 


OQ CAAA 

^y,ouuu 


OA C 
£l4b 


A /I QAAQ7AQ 

u,4ouuo/uy 


OA 70CC 

oU, / doo 


OQ 7CAA 

d\d, I OUU 


OA 7 
^4/ 


A A OA 1 ACQQ 

U,4y41 UOOo 


01 COOQ 

ol ,oddo 


OA AAAA 

oU,UUUU 


OA Q 

*i40 


A CAQCO/1A/1 

U,0Uo0o4U4 


OO C/ICO 

od,o^od 


OA OCAA 




A COOOQOCA 
U,0£_OOOOOU 


oq AQCC 

00,*+c?00 


OA CAAA 
OU,OUUU 


250 


0,53866657 


34,4747 


30,7500 


251 


0,55439592 


35,4813 


31,0000 


252 


0,57058457 


36,5174 


31,2500 


253 


0,58724594 


37,5837 


31,5000 


254 


0,60439384 


38,6812 


31,7500 


255 


0,62204245 


39,8107 


32,0000 
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D.5 Huffman Code Books 
D.5.1 3 Levels 



Table A3 



Quantization level 


Code length 


Code 





1 





1 


2 


2 


-1 


2 


3 



D.5.2 4 Levels (For TMODE) 



Table A4 Table B4 



Quantization level 


Code length 


Code 




Quantization level 


Code length 


Code 





1 










2 


2 


1 


2 


2 




1 


3 


6 


2 


3 


6 




2 


3 


7 


3 


3 


7 




3 


1 





Table C4 




Table D4 




Quantization level 


Code length 


Code 




Quantization level 


Code length 


Code 





3 


6 







2 





1 


3 


7 




1 


2 


1 


2 


1 







2 


2 


2 


3 


2 


2 




3 


2 


3 



D.5.3 5 Levels 



Table A5 Table B5 Table C5 



Quantization 


Code 


Code 




Quantization 


Code 


Code 




Quantization 


Code 


Code 


level 


length 






level 


length 






level 


length 







1 










2 


2 







1 





1 


2 


2 




1 


2 







1 


3 


4 


-1 


3 


6 




-1 


2 


1 




-1 


3 


5 


2 


4 


14 




2 


3 


6 




2 


3 


6 


-2 


4 


15 




-2 


3 


7 




-2 


3 


7 
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D.5.4 7 Levels 



Table A7 



Table B7 



Table C7 



Quantization 


Code 


Code 


level 


length 







1 





1 


3 


6 


-1 


3 


5 


2 


3 


4 


-2 


4 


14 


3 


5 


31 


-3 


5 


30 



Quantization 


Code 


Code 


level 


length 







2 


3 


1 


2 


1 


-1 


2 





2 


3 


4 


-2 


4 


11 


3 


5 


21 


-3 


5 


20 



Quantization 


Code 


Code 


level 


length 







2 


3 


1 


2 


2 


-1 


2 


1 


2 


4 


3 


-2 


4 


2 


3 


4 


1 


-3 


4 






D.5.5 9 Levels 



Table A9 



Table B9 



Table C9 



Quantization 


Code 


Code 


level 


length 







1 





1 


3 


7 


-1 


3 


5 


2 


4 


13 


-2 


4 


9 


3 


4 


8 


-3 


5 


25 


4 


6 


49 


-4 


6 


48 



Quantization 


Code 


Code 


level 


length 







2 


2 


1 


2 





-1 


3 


7 


2 


3 


3 


-2 


3 


2 


3 


5 


27 


-3 


5 


26 


4 


5 


25 


-4 


5 


24 



Quantization 


Code 


Code 


level 


length 







2 


2 


1 


2 





-1 


3 


7 


2 


3 


6 


-2 


3 


2 


3 


4 


6 


-3 


5 


15 


4 


6 


29 


-4 


6 


28 



D.5.6 12 Levels (for BHUFF) 



Table A12 



Table B12 



Table C12 



ABITS 


Code length 


Code 


1 


1 





2 


2 


2 


3 


3 


6 


4 


4 


14 


5 


5 


30 


6 


6 


62 


7 


8 


255 


8 


8 


254 


9 


9 


507 


10 


9 


506 


11 


9 


505 


12 


9 


504 



ABITS 


Code length 


Code 


1 


1 


1 


2 


2 





3 


3 


2 


4 


5 


15 


5 


5 


12 


6 


6 


29 


7 


7 


57 


8 


7 


56 


9 


7 


55 


10 


7 


54 


11 


7 


53 


12 


7 


52 



ABITS 


Code length 


Code 


1 


2 





2 


3 


7 


3 


3 


5 


4 


3 


4 


5 


3 


2 


6 


4 


13 


7 


4 


12 


8 


4 


6 


9 


5 


15 


10 


6 


29 


11 


7 


57 


12 


7 


56 
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Table D12 



ABITS 


Code length 


Code 


1 


2 


3 


2 


2 


2 


3 


2 





4 


3 


2 


5 


4 


6 


6 


5 


14 


7 


6 


30 


8 


7 


62 


9 


8 


126 


10 


9 


254 


11 


10 


511 


12 


10 


510 



Table E12 



ABITS 


Codp Ipnoth 


Code 


i 


I 


i 


p 


O 

c. 


n 


n, 
o 


r> 
o 


p 


4 


4 


6 


5 


5 


14 


6 


7 


63 


7 


7 


61 


8 


8 


124 


9 


8 


121 


10 


8 


120 


11 


9 


251 


12 


9 


250 



D.5.7 13 Levels 



Table A13 



Table B13 



Table C13 



Quantization 


Code 


Code 


level 


length 







1 





1 


3 


4 


-1 


4 


15 


2 


4 


13 


-2 


4 


12 


3 


4 


10 


-3 


5 


29 


4 


5 


22 


-4 


6 


57 


5 


6 


47 


-5 


6 


46 


6 


7 


113 


-6 


7 


112 



Quantization 


Code 


Code 


level 


length 







2 





1 


3 


6 


-1 


3 


5 


2 


3 


2 


-2 


4 


15 


3 


4 


9 


-3 


4 


7 


4 


4 


6 


-4 


5 


29 


5 


5 


17 


-5 


5 


16 


6 


6 


57 


-6 


6 


56 



Quantization 


Code 


Code 


level 


length 







3 


5 


1 


3 


4 


-1 


3 


3 


2 


3 


2 


-2 


3 





3 


4 


15 


-3 


4 


14 


4 


4 


12 


-4 


4 


3 


5 


5 


27 


-5 


5 


26 


6 


5 


5 


-6 


5 


4 



D.5.8 17 Levels 



Table A17 



Table B17 



Quantization level 


Code length 


Code 





2 


1 


1 


3 


7 


-1 


3 


6 


2 


3 


4 


-2 


3 


1 


3 


4 


11 


-3 


4 


10 


4 


4 





-4 


5 


3 


5 


6 


4 


-5 


7 


11 


6 


8 


20 


-6 


9 


43 


7 


10 


84 


-7 


11 


171 


8 


12 


341 


-8 


12 


340 



Quantization level 


Code length 


Code 





2 





1 


3 


6 


-1 


3 


5 


2 


3 


2 


-2 


4 


15 


3 


4 


9 


-3 


4 


8 


4 


5 


29 


-4 


5 


28 


5 


5 


14 


-5 


5 


13 


6 


6 


30 


-6 


6 


25 


7 


6 


24 


-7 


7 


63 


8 


8 


125 


-8 


8 


124 
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Tabled 7 



Table D17 



Quantization level 


Code length 


Code 







3 


6 




1 


3 


4 




-1 


3 


3 




2 


3 







-2 


4 


15 




3 


4 


1 1 




-3 


4 


10 




4 


4 


4 




-4 


4 


3 




5 


5 


29 




-5 


5 


28 




6 


5 


10 




-6 


5 


5 




7 


5 


4 




-7 


6 


23 




8 


7 


45 




-8 


7 


44 




Table E17 








Quantization level 


Code length 


Code 





1 





1 


3 


5 


-1 


3 


4 


2 


4 


12 


-2 


5 


31 


3 


5 


28 


-3 


5 


27 


4 


6 


60 


-4 


6 


59 


5 


6 


53 


-5 


6 


52 


6 


7 


122 


-6 


7 


117 


7 


8 


247 


-7 


8 


246 


8 


8 


233 


-8 


8 


232 



Quantization level 


Code length 


Code 





1 





1 


3 


7 


-1 


3 


6 


2 


4 


1 1 


-2 


4 


10 


3 


5 


19 


-3 


5 


18 


4 


6 


35 


-4 


6 


34 


5 


7 


67 


-5 


7 


66 


6 


8 


131 


-6 


8 


130 


7 


9 


259 


-7 


9 


258 


8 


9 


257 


-8 


9 


256 



Table F17 



Quantization level 


Code length 


Code 





3 


6 


1 


3 


5 


-1 


3 


4 


2 


3 


2 


-2 


3 


1 


3 


4 


15 


-3 


4 


14 


4 


4 


6 


-4 


4 


1 


5 


5 


14 


-5 


5 


1 


6 


6 


31 


-6 


6 


30 


7 


6 





-7 


7 


3 


8 


8 


5 


-8 


8 


4 



Table G17 



Quantization level 


Code length 


Code 





2 


2 


1 


3 


7 


-1 


3 


6 


2 


3 


1 


-2 


3 





3 


4 


5 


-3 


4 


4 


4 


5 


14 


-4 


5 


13 


5 


6 


30 


-5 


6 


25 


6 


7 


62 


-6 


7 


49 


7 


8 


127 


-7 


8 


126 


8 


8 


97 


-8 


8 


96 
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D.5.9 25 Levels 



Table A25 



Quantization level 


Code length 


Code 





3 


6 


1 


3 


4 


-1 


3 


3 


2 


3 


1 


-2 


3 





3 


4 


15 


-3 


4 


14 


4 


4 


5 


-4 


4 


4 


5 


5 


22 


-5 


5 


21 


6 


6 


47 


-6 


6 


46 


7 


7 


83 


-7 


7 


82 


8 


8 


163 


-8 


8 


162 


9 


8 


160 


-9 


9 


323 


10 


10 


644 


-10 


11 


1 291 


11 


12 


2 580 


-11 


13 


5 163 


12 


14 


10 325 


-12 


14 


10 324 


Table C25 


Quantization level 


Code length 


Code 





3 


1 


1 


4 


15 


-1 


4 


14 


2 


4 


12 


-2 


4 


11 


3 


4 


9 


-3 


4 


8 


4 


4 


6 


-4 


4 


5 


5 


4 


1 


-5 


4 





6 


5 


26 


-6 


5 


21 


7 


5 


15 


-7 


5 


14 


8 


5 


8 


-8 


6 


55 


9 


6 


41 


-9 


6 


40 


10 


6 


18 


-10 


7 


109 


11 


7 


108 


-11 


7 


39 


12 


8 


77 


-12 


8 


76 



Table B25 



Quantization level 


Code length 


Code 





3 


5 


1 


3 


2 


-1 


3 


1 


2 


4 


15 


-2 


4 


14 


3 


4 


9 


-3 


4 


8 


4 


4 


6 


-4 


4 


1 


5 


5 


26 


-5 


5 


25 


6 


5 


15 


-6 


5 


14 


7 


6 


55 


-7 


6 


54 


8 


6 


49 


-8 


6 


48 


9 


L 6 


1 


-9 


6 





10 


7 


6 


-10 


7 


5 


11 


7 


4 


-11 


8 


15 


12 


9 


29 


-12 


9 


28 


Table D25 


Quantization level 


Code length 


Code 





2 


2 


1 


3 


7 


-1 


3 


6 


2 


3 


1 


-2 


3 





3 


4 


5 


-3 


4 


4 


4 


5 


13 


-4 


5 


12 


5 


6 


29 


-5 


6 


28 


6 


7 


62 


-6 


7 


61 


7 


8 


126 


-7 


8 


121 


8 


9 


255 


-8 


9 


254 


9 


10 


483 


-9 


10 


482 


10 


11 


963 


-10 


11 


962 


11 


12 


1 923 


-11 


12 


1 922 


12 


12 


1 921 


-12 


12 


1 920 
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Table E25 



Table F25 



Quantization level 


Code length 


Code 





2 


3 


1 


3 


3 


-1 


3 


2 


2 


4 


11 


-2 


4 


10 


3 


4 


1 


-3 


4 





4 


5 


17 


-4 


5 


16 


5 


5 


5 


-5 


5 


4 


6 


6 


38 


-6 


6 


37 


7 


6 


14 


-7 


6 


13 


8 


7 


79 


-8 


7 


78 


9 


7 


72 


-9 


7 


31 


10 


7 


25 


-10 


7 


24 


11 


8 


147 


-11 


8 


146 


12 


8 


61 


-12 


8 


60 



Quantization level 


Code length 


Code 





3 


1 


1 


3 





-1 


4 


15 


2 


4 


14 


-2 


4 


13 


3 


4 


1 1 


-3 


4 


10 


4 


4 


8 


-4 


r 4 


7 


5 


4 


5 


-5 


4 


4 


6 


5 


24 


-6 


5 


19 


7 


5 


13 


-7 


5 


12 


8 


6 


37 


-8 


L 6 


36 


9 


7 


102 


-9 


7 


101 


10 


8 


207 


-10 


8 


206 


11 


8 


200 


-11 


9 


403 


12 


10 


805 


-12 


10 


804 



Table G25 



Quantization level 


Code length 


Code 





2 


1 


1 


3 


6 


-1 


3 


5 


2 


3 j 





-2 


4 


15 


3 


4 


8 


-3 


4 


3 


4 


5 


28 


-4 


5 


19 


5 


5 


4 


-5 


6 


59 


6 


6 


36 


-6 


6 


11 


7 


7 


116 


-7 


7 


75 


8 


7 


21 


-8 


7 


20 


9 


8 


149 


-9 


8 


148 


10 


9 


470 


-10 


9 


469 


11 


10 


943 


-11 


10 


942 


12 


10 


937 


-12 


10 


936 



ETSI 



219 



ETSI TS 102 114 V1. 4.1 (2012-09) 



D.5.10 33 Levels 



Table A33 



Quantization 
level 


Code 
length 


uoae 





3 


2 


1 


3 


1 


-1 


3 





2 


4 


14 


-2 


4 


13 


3 


4 


12 


-3 


4 


11 


4 


4 


9 


-4 


4 


8 


5 


4 


6 


-5 


5 


31 


6 


5 


20 


-6 


5 


15 


7 


6 


61 


-7 


6 


60 


8 


6 


29 


-8 


6 


28 


9 


7 


85 


-9 


7 


84 


10 


8 


174 


-10 


8 


173 


11 


9 


351 


-11 


9 


350 


12 


10 


691 


-12 


10 


690 


13 


11 


1 379 


-13 


11 


1 378 


14 


12 


2 755 


-14 


12 


2 754 


15 


13 


5 507 


-15 


13 


5 506 


16 


13 


5 505 


-16 


13 


5 504 



Table B33 



Quantization 


Code 


Code 


level 


length 







3 


1 


1 


4 


15 


-1 


4 


14 


2 


4 


11 


-2 


4 


10 


3 


4 


8 


-3 


4 


7 


4 


4 


4 


-4 


4 


1 


5 


5 


27 


-5 


5 


26 


6 


5 


19 


-6 


5 


18 


7 


5 


12 


-7 


5 


11 


8 


5 


1 


-8 


5 





9 


6 


50 


-9 


6 


49 


10 


6 


26 


-10 


6 


21 


11 


7 


103 


-11 


7 


102 


12 


7 


96 


-12 


7 


55 


13 


7 


41 


-13 


7 


40 


14 


8 


194 


-14 


8 


109 


15 


8 


108 


-15 


9 


391 


16 


10 


781 


-16 


10 


780 



ETSI 



Table C33 



Quantization 
Level 


Code 
Length 


lsOCI6 





4 


13 


1 


4 


11 


-1 


4 


10 


2 


4 


8 


-2 


4 


7 


3 


4 


4 


-3 


4 


3 


4 


4 


2 


-4 


4 


1 


5 


5 


30 


-5 


5 


29 


6 


5 


25 


-6 


5 


24 


7 


5 


19 


-7 


5 


18 


8 


5 


11 


-8 


5 


10 


9 


5 





-9 


6 


63 


10 


6 


62 


-10 


6 


57 


11 


6 


27 


-11 


6 


26 


12 


6 


24 


-12 


6 


3 


13 


7 


113 


-13 


7 


112 


14 


7 


50 


-14 


7 


5 


15 


7 


4 


-15 


8 


103 


16 


9 


205 


-16 


9 


204 



Table D33 



Quantization level 


Code length 


Code 





2 


1 


1 


3 


6 


-1 


3 


5 


2 


3 





-2 


4 


15 


3 


4 


8 


-3 


4 


3 


4 


5 


28 


-4 


5 


19 


5 


5 


4 


-5 


6 


59 


6 


6 


36 


-6 


6 


11 


7 


7 


116 


-7 n 


7 


75 


8 


7 


21 


-8 


7 


20 


9 


8 


149 


-9 


8 


148 


10 


9 


469 


-10 


9 


468 


11 


10 


941 


-1 1 


10 


940 


12 


1 1 


1 885 


-1 2 


1 1 


1 884 


13 


12 


3 773 


-13 


12 


3 772 


14 


13 


7 551 


-14 


13 


7 550 


15 


14 


15 099 


-15 


14 


15 098 


16 


14 


15 097 


-16 


14 


15 096 



220 ETSI TS 1 02 1 1 4 V1 .4.1 (201 2-09) 



Table E33 



Quantization level 


Code length 


Code 





2 


2 


1 


3 


2 


-1 


3 


1 


2 


4 


12 


-2 


4 


7 


3 


4 





-3 


5 


31 


4 


5 


27 


-4 


5 


26 


5 


5 


3 


-5 


5 


2 


6 


6 


59 


-6 


6 


58 


7 


6 


27 


-7 


6 


26 


8 


7 


123 


-8 


7 


122 


9 


7 


120 


-9 


7 


115 


10 


7 


112 


-10 


7 


51 


11 


7 


49 


-1 1 


7 


48 


12 


8 


242 


-1 2 


8 


229 


13 


8 


227 


-13 


8 


226 


14 


8 


101 


-14 


8 


100 


15 


9 


487 


-15 


9 


486 


16 


9 


457 


-16 


9 


456 



ETSI 



Table F33 



Quantization level 


Code length 


Code 





4 


13 


1 


4 


12 


-1 


4 


1 1 


2 


4 


9 


-2 


4 


8 


3 


4 


7 


-3 


4 


6 


4 


4 


4 


-4 


4 


3 


5 


4 


1 


-5 


4 





6 


5 


30 


-6 


5 


29 


7 


5 


21 


-7 


5 


20 


8 


5 


10 


-8 


5 


5 


9 


6 


63 


-9 


6 


62 


10 


6 


56 


-10 


6 


23 


1 1 


6 


9 


-11 


6 


8 


12 


7 


45 


-12 


7 


44 


13 


8 


230 


-13 


8 


229 


14 


9 


463 


-14 


9 


462 


15 


9 


456 


-15 


10 


915 


16 


11 


1 829 


-16 


11 


1 828 



221 ETSI TS 1 02 1 1 4 V1 .4.1 (201 2-09) 



Table G33 



Quantization level 


Code length 


Code 





3 


6 


1 


3 


3 


-1 


3 


2 


2 


4 


15 


-2 


4 


14 


3 


4 


9 


-3 


4 


8 


4 


4 


1 


-4 


4 





5 


5 


22 


-5 


5 


21 


6 


5 


6 


-6 


5 


5 


7 


6 


46 


-7 


6 


41 


8 


6 


14 


-8 


6 


L 9 


9 


7 


94 


-9 


7 


81 


10 


7 


30 


-10 


7 


17 


1 1 


8 


191 


-11 


8 


190 


12 


8 


63 


-12 


8 


62 


13 


8 


32 


-13 


9 


323 


14 


9 


321 


-14 


9 


320 


15 


9 


67 


-15 


9 


66 


16 


10 


645 


-16 


10 


644 



ETSI 



222 



ETSI TS 102 114 V1. 4.1 (2012-09) 



D.5.11 65 Levels 



Quantization 


Code 


Code 


level 


length 







4 


6 


1 


4 


5 


-1 


4 


4 


2 


4 


2 


-2 


4 


1 


3 


4 





-3 


5 


31 


4 


5 


29 


-4 


5 


28 


5 


5 


27 


-5 


5 


26 


6 


5 


24 


-6 


5 


23 


7 


5 


21 


-7 


5 


20 


8 


5 


18 


-8 


5 


17 


9 


5 


14 


-9 


5 


7 


10 


5 


6 


-10 


6 


61 


11 


6 


50 



Quantization 


Code 


Code 


level 


length 







4 


4 


1 


4 


2 


-1 


4 


1 


2 


5 


30 


-2 


5 


29 


3 


5 


26 


-3 


5 


25 


4 


5 


23 


-4 


5 


22 


5 


5 


19 


-5 


5 


18 


6 


5 


16 


-6 


5 


15 


7 


5 


12 


-7 


5 


11 


8 


5 


7 


-8 


5 


6 


9 


6 


63 


-9 


6 


62 


10 


6 j 


56 


-10 


6 


55 


11 


6 


49 


-11 


6 


48 



Table A65 



Quantization 
level 


Code 
length 


Code 


-11 


6 


45 


12 


6 


38 


-12 


6 


33 


13 


6 


31 


-13 


6 


30 


14 


7 


120 


-14 


7 


103 


15 


7 


89 


-15 


7 


88 


16 


7 


65 


-16 


7 


64 


17 


8 


205 


-17 


8 


204 


18 


8 


157 


-18 


8 


156 


19 


9 


486 


-19 


9 


485 


20 


9 


318 


-20 


9 


317 


21 


10 


975 


-21 


10 


974 


22 


10 


639 


Table B65 


Quantization 
level 


Code 
length 


Code 


12 


6 


41 


-12 


6 


40 


13 


6 


34 


-13 


6 


29 


14 


6 


26 


-14 


6 


21 


15 


6 


20 


-15 


6 


3 


16 


6 





-16 


7 


115 


17 


7 


109 


-17 


7 


108 


18 


7 


86 


-18 


7 


85 


19 


7 


70 


-19 


7 


57 


20 


7 


56 


-20 


7 


55 


21 


7 


4 


-21 


7 


3 


22 


8 


229 


-22 


8 


228 


23 


8 


175 



Quantization 


Code 


Code 


level 


length 




-22 


10 


638 


23 


11 


1 939 


-23 


11 


1 938 


24 


11 


1 936 


-24 


11 


1 267 


25 


11 


1 264 


-25 


12 


3 875 


26 


12 


2 532 


-26 


12 


2 531 


27 


13 


7 749 


-27 


13 


7 748 


28 


13 


5 061 


-28 


13 


5 060 


29 


14 


10 133 


-29 


14 


10 132 


30 


15 


20 269 


-30 


15 


20 268 


31 


16 


40 543 


-31 


16 


40 542 


32 


16 


40 541 


-32 


16 


40 540 



Quantization 


Code 


Code 


level 


length 




-23 


8 


174 


24 


8 


143 


-24 


8 


142 


25 


8 


108 


-25 


8 


11 


26 


8 


10 


-26 


8 


5 


27 


9 


339 


-27 


9 


338 


28 


9 


336 


-28 


9 


219 


29 


9 


9 


-29 


9 


8 


30 


10 


674 


-30 


10 


437 


31 


10 


436 


-31 


11 


1 351 


32 


12 


2 701 


-32 


12 


2 700 



ETSI 



223 



ETSI TS 102 114 V1. 4.1 (2012-09) 



Quantization 


Code 




Code 


■ . . „ i 

level 


length 









5 


28 


1 


5 


25 


-1 


5 


24 


2 


5 


23 


-2 


5 


22 


3 


5 


19 


-3 


5 


18 


4 


5 


16 


-4 


5 


15 


5 


5 


13 


-5 


5 


12 


6 


5 


10 


-6 


5 


9 


7 


5 


7 


-7 


5 


6 


8 


5 


4 


-8 


5 


3 


9 


5 


1 


-9 


5 





10 


6 


62 


-10 


6 


61 


11 


6 


59 




Quantization 


Code 


Code 


level 


length 









3 


4 


1 


3 


1 


-1 


3 





2 


4 


13 


-2 


4 


12 


3 


4 


7 


-3 


4 


6 


4 


5 


31 


-4 


5 


30 


5 


5 


23 


-5 


5 


22 


6 


5 


11 


-6 


5 


10 


7 


6 


59 


-7 


6 


58 


8 


6 


43 


-8 


6 


42 


9 


6 


19 


-9 


6 


18 


10 


7 


115 


-10 


7 


114 



Table C65 



Quantization 


Code 




Code 


level 


length 






-1 1 


6 


58 


12 


6 


54 


-12 


6 


53 


13 


6 


43 


-13 


6 


42 


14 


6 


40 


-14 


6 


35 


15 


6 


29 


-15 


6 


28 


16 


6 


17 


-16 


6 


16 


17 


6 


11 


-17 


6 


10 


18 


6 


4 


-18 


7 


127 


19 


7 


121 


-19 


7 


120 


20 


7 


110 


-20 


7 


105 


21 


7 


83 


-21 


7 


82 


22 


7 


68 


Table D65 


Quantization 


Code 


Code 


level 


length 






11 


7 


83 


-11 


7 


82 


12 


7 


35 


-12 


7 


34 


13 


8 


227 


-13 


8 


226 


14 


8 


163 


-14 


8 


162 


15 


8 


160 


-15 


8 


67 


16 


8 


64 


-16 


9 


451 


17 


9 


448 


-17 


9 


323 


18 


9 


132 


-18 


9 


131 


19 


10 


900 


-19 


10 


899 


20 


10 


644 


-20 


10 


267 


21 


10 


261 



Quantization 
level 


Code 
length 


Code 


-22 


7 


47 


23 


7 


46 


-23 


7 


45 


24 


7 


1 1 


-24 


7 


10 


25 


8 


252 


-25 


8 


223 


26 


8 


209 


-26 


8 


208 


27 


8 


138 


-27 


8 


89 


28 


8 


88 


-28 


9 


507 


29 


9 


445 


-29 


9 


444 


30 


9 


278 


-30 


10 


1013 


31 


10 


1012 


-31 


10 


559 


32 


11 


1117 


-32 


11 


1116 



Quantization 


Code 


Code 


level 


length 




-21 


10 


260 


22 


L 11 


1 797 


-22 


11 


1 796 


23 


11 


533 


-23 


11 


532 


24 


12 


3 605 


-24 


12 


3 604 


25 


12 


2 582 


-25 


12 


2 581 


26 


13 


7 215 


-26 


13 


7 214 


27 


13 


5 167 


-27 


13 


5 166 


28 


13 


5 160 


-28 


14 


14 427 


29 


14 


10 323 


-29 


14 


10 322 


30 


15 


28 853 


-30 


15 


28 852 



ETSI 



224 



ETSI TS 102 114 V1. 4.1 (2012-09) 



Quantization 


Code 




Code 


■ . . „ i 

level 


length 









3 


4 


1 


3 





-1 


4 


15 


2 


4 


7 


-2 


4 


6 


3 


5 


29 


-3 


5 


28 


4 


5 


23 


-4 


5 


22 


5 


5 


10 


-5 


5 


9 


6 


5 


6 


-6 


5 


5 


7 


6 


54 


-7 


6 


53 


8 


6 


48 


-8 


6 


43 


9 


6 


40 


-9 


6 


23 


10 


6 


16 


-10 


6 


15 


11 


6 


9 




Quantization 


Code 


Code 


level 


length 









3 


6 


1 


3 


3 


-1 


3 


2 


2 


4 


15 


-2 


4 


14 


3 


4 


9 


-3 


4 


8 


4 


4 


1 


-4 


4 





5 


5 


21 


-5 


5 


20 


6 


5 


5 


-6 


5 


4 


7 


6 


45 


-7 


6 


44 


8 


6 


13 


-8 


6 


12 


9 


7 


93 


-9 


7 


92 


10 


7 


29 


-10 


7 


28 


11 


8 


189 



Table E65 



Quantization 


Code 




Code 


level 


length 






-1 1 


6 


8 


12 


7 


105 


-12 


7 


104 


13 


7 


100 


-13 


7 


99 


14 


7 


84 


-14 


7 


83 


15 


7 


45 


-15 


7 


44 


16 


7 


29 


-16 


7 


28 


17 


8 


221 


-17 


8 


220 


18 


8 


206 


-18 


8 


205 


19 


8 


202 


-19 


8 


197 


20 


8 


171 


-20 


8 


170 


21 


8 


164 


-21 


8 


71 


22 


8 


69 


Table F65 


Quantization 


Code 


Code 


level 


length 






-11 


8 


188 


12 


8 


61 


-12 


8 


60 


13 


9 


381 


-13 


9 


380 


14 


9 


125 


-14 


9 


124 


15 


10 


765 


-15 


10 


764 


16 


10 


252 


-16 


11 


1 535 


17 


11 


1 532 


-17 


11 


511 


18 


11 


506 


-18 


12 


3 069 


19 


12 


3 067 


-19 


12 


3 066 


20 


12 


1 015 


-20 


12 


1 014 


21 


13 


6 136 


-21 


13 


2 043 


22 


13 


2 035 



Quantization 
level 


Code 
length 


Code 


-22 


8 


68 


23 


9 


446 


-23 


9 


445 


24 


9 


415 


-24 


9 


414 


25 


9 


408 


-25 


9 


407 


26 


9 


393 


-26 


9 


392 


27 


9 


331 


-27 


9 


330 


28 


9 


141 


-28 


9 


140 


29 


10 


895 


-29 


10 


894 


30 


10 


889 


-30 


10 


888 


31 


10 


819 


-31 


10 


818 


32 


10 


813 


-32 


10 


812 



Quantization 


Code 


Code 


level 


length 




-22 


13 


2 034 


23 


L 14 


12 275 


-23 


14 


12 274 


24 


14 


4 085 


-24 


14 


4 084 


25 


14 


4 083 


-25 


14 


4 082 


26 


14 


4 081 


-26 


14 


4 080 


27 


14 


4 079 


-27 


14 


4 078 


28 


14 


4 077 


-28 


14 


4 076 


29 


14 


4 075 


-29 


14 


4 074 


30 


14 


4 073 


-30 


14 


4 072 


31 


14 


4 067 


-31 


14 


4 066 


32 


14 


4 065 


-32 


14 


4 064 



ETSI 



225 



ETSI TS 102 114 V1.4.1 (2012-09) 



Quantization 


Code 


Code 


■ . . „ i 

level 


length 







4 


14 


1 


4 


1 1 


-1 


4 


10 


2 


4 


8 


-2 


4 


6 


3 


4 


4 


-3 


4 


3 


4 


4 





-4 


5 


31 


5 


5 


26 


-5 


5 


25 


6 


5 


18 


-6 


5 


15 


7 


5 


10 


-7 


5 


5 


8 


5 


2 


-8 


6 


61 


9 


6 


54 


-9 


6 


49 


10 


6 


38 


-10 


6 


29 


11 


6 


22 



Table G65 



Quantization 


Code 


Code 


level 


length 




-1 1 


6 


9 


12 


6 


6 


-12 


7 


121 


13 


7 


110 


-13 


7 


97 


14 


7 


78 


-14 


7 


57 


15 


7 


46 


-15 


7 


17 


16 


7 


14 


-16 


8 


241 


17 


8 


223 


-17 


8 


222 


18 


8 


159 


-18 


8 


158 


19 


8 


95 


-19 


8 


94 


20 


8 


31 


-20 


8 


30 


21 


9 


480 


-21 


9 


387 


22 


9 


384 



Quantization 
level 


Code 
length 


Code 


-22 


9 


227 


23 


9 


225 


-23 


9 


224 


24 


9 


65 


-24 


9 


64 


25 


10 


962 


-25 


10 


773 


26 


10 


771 


-26 


10 


770 


27 


10 


452 


-27 


10 


135 


28 


10 


133 


-28 


10 


1 32 


29 


11 


1 927 


-29 


11 


1 926 


30 


11 


1 545 


-30 


11 


1 544 


31 


11 


907 


-31 


11 


906 


32 


11 


269 


-32 


11 


268 



ETSI 



226 



ETSI TS 102 114 V1. 4.1 (2012-09) 



D.5.12 129 Levels 



Quantization 


Code 


Code 


level 


length 







2 


1 


1 


3 


6 


-1 


3 


5 


2 


3 





-2 


4 


15 


3 


4 


8 


-3 


4 


3 


4 


5 


28 


-4 


5 


19 


5 


5 


4 


-5 


6 


59 


6 


6 


36 


-6 


6 


11 


7 


7 


75 


-7 


7 


74 


8 


8 


233 


-8 


8 


232 


9 


8 


41 


-9 


8 


40 


10 


9 


87 


-10 


9 


86 


11 


10 


937 


-11 


10 


936 


12 


11 


1 877 


-12 


11 


1 876 


13 
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D.6 Block Code Books 
D.6.1 3 Levels 

Table V.3: 3-level 4-element 7-bit Block Code Book 



Level index 


Code for 1 SI element 


-1 








1 


1 


2 


Level index 


Code for 2 na element 


-1 








3 


1 


6 


Level index 


Code for 3™ element 


-1 








9 


1 


18 


Level index 


Code for 4'" element 


-1 








27 


1 


54 



D.6.2 5 Levels 

Table V.5: 5-level 4-element 10-bit Block Code Book 



Level index 


Code for 1 SI element 


-2 





-1 


1 





2 


1 


3 


2 


4 


Level index 


Code for 2 na element 


-2 





-1 


5 





10 


1 


15 


2 


20 


Level index 


Code for 3™ element 


-2 





-1 


25 





50 


1 


75 


2 


100 


Level index 


Code for 4 in element 


-2 





-1 


125 





250 


1 


375 


2 


500 
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D.6.3 7 Levels 

Table V.7: 7-level 4-element 12-bit Block Code Book 



Level index 


Code for 1 SI element 


-3 





-2 


1 


-1 


2 





3 


1 


4 


2 


5 


3 


6 


Level index 


Code for 2 na element 


-3 





-2 


7 


-1 


14 





21 


1 


28 


2 


35 


3 


42 


Level index 


Code for 3 ra element 


-3 





-2 


49 


-1 


98 





47 


1 


196 


2 


245 


3 


294 


Level index 


Code for 4 in element 


-3 





-2 


343 


-1 


686 





1 029 


1 


1 372 


2 


1 715 


3 


2 058 



D.6.4 9 Levels 



Table V.9: 9-level 4-element 13-bit Block Code Book 



Level index 


Code for 1 st element 


-4 





-3 


1 


-2 


2 


-1 


3 





4 


1 


5 


2 


6 


3 


7 


4 


8 


Level index 


Code for 2 na element 


-4 





-3 


9 


-2 


18 


-1 


27 





36 


1 


45 


2 


54 


3 


63 


4 


72 



Level index 


Code for 3™ element 


-4 





-3 


81 


-2 


162 


-1 


243 





324 


1 


405 


2 


486 


3 


567 


4 


648 


Level index 


Code for 4 1 " element 


-4 





-3 


729 


-2 


1 458 


-1 


2 187 





2916 


1 


3 645 


2 


4 374 


3 


5 103 


4 


5 832 
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D.6.5 13 Levels 



Table V.13: 13-level 4-element 15-bit block 



Level index 


Code for 1 SI element 


-6 





-5 


1 


-4 


2 


-3 


3 


-2 


4 


-1 


5 





6 


1 


7 


2 


8 


3 


9 


4 


10 


5 


11 


6 


12 


Level index 


Code for 2 na element 


-6 





-5 


13 


-4 


26 


-3 


39 


-2 


52 


-1 


65 





78 


1 


91 


2 


104 


3 


117 


4 


130 


5 


143 


6 


156 



Level index 


Code for 3™ element 


-6 





-5 


169 


-4 


338 


-3 


507 


-2 


676 


-1 


845 





1 014 


1 


1 183 


2 


1 352 


3 


1 521 


4 


1 690 


5 


1 859 


6 


2 028 


Level index 


Code for 4 tn element 


-6 





-5 


2 197 


-4 


4 394 


-3 


6 591 


-2 


8 788 


-1 


10 985 





13 182 


1 


15 379 


2 


17 576 


3 


19 773 


4 


21 970 


5 


24 167 


6 


26 364 
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6 1 7 Levels 



Table V.17: 17-level 4-element 17-bit Block Code Book 



Level index 


Code for 1 SI element 


-8 





-7 


1 


-6 


2 


-5 


3 


-4 


4 


-3 


5 


-2 


6 


-1 


7 





8 


1 


9 


2 


10 


3 


11 


4 


12 


5 


13 


6 


14 


7 


15 


8 


16 


Level index 


Code for 2 na element 


-8 





-7 


17 


-6 


34 


-5 


51 


-4 


68 


-3 


85 


-2 


102 


-1 


119 





136 


1 


153 


2 


170 


3 


187 


4 


204 


5 


221 


6 


238 


7 


255 


8 


272 



Level index 


Code for 3™ element 


-8 





-7 


289 


-6 


578 


-5 


867 


-4 


1 156 


-3 


1 445 


-2 


1 734 


-1 


2 023 





2 312 


1 


2 601 


2 


2 890 


3 


3 179 


4 


3 468 


5 


3 757 


6 


4 046 


7 


4 335 


8 


4 624 


Level index 


Code for 4'" element 


-8 





-7 


4 913 


-6 


9 826 


-5 


14 739 


-4 


19 652 


-3 


24 565 


-2 


29 478 


-1 


34 391 





39 304 


1 


44 217 


2 


49 130 


3 


54 043 


4 


58 956 


5 


63 869 


6 


68 782 


7 


73 695 


8 


78 608 
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7 25 Levels 



Table V.25: 25-level 4-element 19-bit Block Code Book 



Level index 


Code for 1 SI element 


-12 





-11 


1 


-10 


2 


-9 


3 


-8 


4 


-7 


5 


-6 


6 


-5 


7 


-4 


8 


-3 


9 


-2 


10 


-1 


11 





12 


1 


13 


2 


14 


3 


15 


4 


16 


5 


17 


6 


18 


7 


19 


8 


20 


9 


21 


10 


22 


11 


23 


12 


24 


Level index 


Code for 2™ element 


-12 





-11 


25 


-10 


50 


-9 


75 


-8 


100 


-7 


125 


-6 


150 


-5 


175 


-4 


200 


-3 


225 


-2 


250 


-1 


275 





300 


1 


325 


2 


350 


3 


375 


4 


400 


5 


425 


6 


450 


7 


475 


8 


500 


9 


525 


10 


550 


11 


575 


12 


600 



Level index 


Code for 3™ element 


-12 





-11 


625 


-10 


1 250 


-9 


1 875 


-8 


2 500 


-7 


3 125 


-6 


3 750 


-5 


4 375 


-4 


5 000 


-3 


5 625 


-2 


6 250 


-1 


6 875 





7 500 


1 


8 125 


2 


8 750 


3 


9 375 


4 


10 000 


5 


10 625 


6 


11 250 


7 


11 875 


8 


12 500 


9 


13 125 


10 


13 750 


11 


14 375 


12 


15 000 


Level index 


Code for 4'" element 


-12 





-11 


15 625 


-10 


31 250 


-9 


46 875 


-8 


62 500 


-7 


78 125 


-6 


93 750 


-5 


1 09 375 


-4 


125 000 


-3 


140 625 


-2 


1 56 250 


-1 


171 875 





187 500 


1 


203 125 


2 


218 750 


3 


234 375 


4 


250 000 


5 


265 625 


6 


281 250 


7 


296 875 


8 


312 500 


9 


328 125 


10 


343 750 


11 


359 375 


12 


375 000 
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D.7 Interpolation FIR 



Index 


FIR - 2 x 


FIR - 4 x 




Interpolation 


Interpolation 





3,305240000e-06 


2,107630000e-06 


1 


-1,095500000e-07 


1, 09481 0000e-05 


2 


-1,133348000e-05 


2,290807000e-05 


3 


-5,509460000e-06 


2,839700000e-05 


4 


2,381 930000e-05 


1 ,428398000e-05 


5 


2,278368000e-05 


-2,752976000e-05 


6 


-3,684078000e-05 


-8,951 150000e-05 


7 


-5,886791 000e-05 


-1, 4279621 00e-04 


8 


4,053684000e-05 


-1, 435831 500e-04 


9 


1,1 868291 00e-04 


-5,40861 3000e-05 


10 


-1 ,809484000e-05 


1, 283221 800e-04 


11 


-2,002544900e-04 


3,488978300e-04 


12 


-5,2991 83000e-05 


4,98251 0600e-04 


13 


2,892986200e-04 


4,505801 800e-04 


14 


1 ,963655800e-04 


1 ,306004900e-04 


15 


-3,546474000e-04 


-4,182235900e-04 


16 


-4,285478200e-04 


-1, 004001 470e-03 


17 


3,466888200e-04 


-1 ,328857730e-03 


18 


7,476581 400e-04 


-1,108668630e-03 


19 


-2,011064500e-04 


-2,3321 88200e-04 


20 


-1,121123670e-03 


1,1 065321 70e-03 


21 


-1, 503691 300e-04 


2,405677920e-03 


22 


1 ,475675030e-03 


2,994750860e-03 


23 


7,618262300e-04 


2,32767221 Oe-03 


24 


-1 ,693736650e-03 


3,03061 91 00e-04 


25 


-1, 6492671 60e-03 


-2,537531 080e-03 


26 


1 ,620259490e-03 


-5,075346680e-03 


27 


2,764807080e-03 


-5,991 244690e-03 


28 


-1,082837000e-03 


-4,355599640e-03 


29 


-3,974851 220e-03 


-1,972305800e-04 


30 


-7,440893000e-05 


5,2372361 50e-03 


31 


5,049238450e-03 


9,746226480e-03 


32 


1, 94591 0740e-03 


1, 09691 4243e-02 


33 


-5,668033380e-03 


7,467648480e-03 


34 


-4,514896780e-03 


-3,564674400e-04 


35 


5,450623580e-03 


-9,986897000e-03 


36 


7,607854900e-03 


-1 ,744846255e-02 


37 


-4,0081 48330e-03 


-1,880371 01 2e-02 


38 


-1,086365897e-02 


-1,1 9831 8321 e-02 


39 


1,01 561 4490e-03 


1 ,828493200e-03 


40 


1, 37270391 Oe-02 


1 ,799243502e-02 


41 


3,704760920e-03 


2,975338697e-02 


42 


-1,547267288e-02 


3,083455376e-02 


43 


-1,010103151e-02 


1 ,837555505e-02 


44 


1 ,526044402e-02 


-5,042277280e-03 


45 


1 ,782309450e-02 


-3,1 3799381 3e-02 


46 


-1,221452747e-02 


-4,9549631 77e-02 


47 


-2,617896535e-02 


-4,967092723e-02 


48 


5,509705280e-03 


-2,767589502e-02 


49 


3,411839902e-02 


1,166744903e-02 


50 


5,563403950e-03 


5,492079630e-02 


51 


-4,02300581 3e-02 


8,387579024e-02 


52 


-2,160109766e-02 


8,227037638e-02 


53 


4,270342737e-02 


4,309020936e-02 


54 


4,324966297e-02 


-2,63743251 6e-02 


55 


-3,911506757e-02 


-1,040880680e-01 


56 


-7,177370787e-02 


-1,583629698e-01 


57 


2,557834797e-02 


-1,573987603e-01 


58 


1,1 09383851 e-01 


-8,037899435e-02 



Index 


FIR-2x 


FIR-4x 




Interpolation 


Interpolation 


59 


6,777029020e-03 


7,367454469e-02 


60 


-1,752302796e-01 


2,826547325e-01 


61 


-9,289701 283e-02 


5,053876638e-01 


62 


3,630628884e-01 


6,92141 1753e-01 


63 


8,234865069e-01 


7,985475659e-01 


64 


8,234865069e-01 


7,985475659e-01 


65 


3,630628884e-01 


6,92141 1753e-01 


66 


-9,289701 283e-02 


5,053876638e-01 


67 


-1,752302796e-01 


2,826547325e-01 


68 


6,777029020e-03 


7,367454469e-02 


69 


1,1 09383851 e-01 


-8,037899435e-02 


70 


2,557834797e-02 


-1,573987603e-01 


71 


-7,177370787e-02 


-1,583629698e-01 


72 


-3,911506757e-02 


-1,040880680e-01 


73 


4,324966297e-02 


-2,63743251 6e-02 


74 


4,270342737e-02 


4,309020936e-02 


75 


-2,160109766e-02 


8,227037638e-02 


76 


-4,02300581 3e-02 


8,387579024e-02 


77 


5,563403950e-03 


5,492079630e-02 


78 


3,411839902e-02 


1,166744903e-02 


79 


5,509705280e-03 


-2,767589502e-02 


80 


-2,617896535e-02 


-4,967092723e-02 


81 


-1,221452747e-02 


-4,9549631 77e-02 


82 


1 ,782309450e-02 


-3,1 3799381 3e-02 


83 


1 ,526044402e-02 


-5,042277280e-03 


84 


-1,01 01 031 51 e-02 


1 ,837555505e-02 


85 


-1,547267288e-02 


3,083455376e-02 


86 


3,704760920e-03 


2,975338697e-02 


87 


1, 37270391 0e-02 


1 ,799243502e-02 


88 


1,01 561 4490e-03 


1 ,828493200e-03 


89 


-1,086365897e-02 


-1,1 9831 8321 e-02 


90 


-4,0081 48330e-03 


-1,880371 01 2e-02 


91 


7,607854900e-03 


-1 ,744846255e-02 


92 


5,450623580e-03 


-9,986897000e-03 


93 


-4,514896780e-03 


-3,564674400e-04 


94 


-5,668033380e-03 


7,467648480e-03 


95 


1, 94591 0740e-03 


1, 09691 4243e-02 


96 


5,049238450e-03 


9,746226480e-03 


97 


-7,440893000e-05 


5,2372361 50e-03 


98 


-3,974851 220e-03 


-1,972305800e-04 


99 


-1,082837000e-03 


-4,355599640e-03 


100 


2,764807080e-03 


-5,991 244690e-03 


101 


1 ,620259490e-03 


-5,075346680e-03 


102 


-1, 6492671 60e-03 


-2,537531 080e-03 


103 


-1,693736650e-03 


3,03061 91 00e-04 


104 


7,618262300e-04 


2,32767221 0e-03 


105 


1 ,475675030e-03 


2,994750860e-03 


106 


-1, 503691 300e-04 


2,405677920e-03 


107 


-1,121 123670e-03 


1,1 065321 70e-03 


108 


-2,011064500e-04 


-2,3321 88200e-04 


109 


7,476581 400e-04 


-1,108668630e-03 


110 


3,466888200e-04 


-1,328857730e-03 


111 


-4,285478200e-04 


-1, 004001 470e-03 


112 


-3,546474000e-04 


-4,182235900e-04 


113 


1 ,963655800e-04 


1 ,306004900e-04 


114 


2,892986200e-04 


4,505801 800e-04 


115 


-5,2991 83000e-05 


4,98251 0600e-04 


116 


-2,002544900e-04 


3,488978300e-04 


117 


-1,809484000e-05 


1, 283221 800e-04 
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Inrlov 

II IUCA 


FIR - 9 y 
rin c. a 


FIR - d y 
rin t a 




Interpolation 


Interpolation 


118 


1,1 868291 00e-04 


-5,40861 3000e-05 


119 


4,053684000e-05 


-1, 435831 500e-04 


120 


-5,886791 000e-05 


-1, 4279621 00e-04 


121 


-3,684078000e-05 


-8,951 150000e-05 


122 


2,278368000e-05 


-2,752976000e-05 



Inrlov 

II IUCA 


FIR . 9 y 
rin £. a 


FIR - 4 y 




Interpolation 


Interpolation 


123 


2,381 930000e-05 


1 ,428398000e-05 


124 


-5,509460000e-06 


2,839700000e-05 


125 


-1,133348000e-05 


2,290807000e-05 


126 


-1,095500000e-07 


1, 09481 0000e-05 


127 


3,305240000e-06 


2,107630000e-06 



D.8 32-Band Interpolation and LFE Interpolation FIR 



Index 


32-Band Interpolation FIR 


LFE Interpolation FIR 


Perfect Reconstruction 


Non-Perfect Reconstruction 


64 x Interpolation 


128 x Interpolation 





1,140033200e-10 


-1,3901 91 784e-07 


2,658434387e-04 


5,31 68571 00e-04 


1 


7, 1 387421 00e-11 


-1 ,693738625e-07 


8,179365250e-05 


1, 6358691 00e-04 


2 


-8,358679600e-09 


-2,030677564e-07 


9,43932391 2e-05 


1 ,887860900e-04 


3 


-2,529296600e-08 


-2,404238444e-07 


1,082170274e-04 


2,164336300e-04 


4 


-9,130198800e-08 


-2,81 81 4351 4e-07 


1, 233371 440e-04 


2,466738200e-04 


5 


-2,771 560000e-07 


-3,2766891 42e-07 


1 ,397485757e-04 


2,794966000e-04 


6 


-5,7461 47600e-07 


-3,784752209e-07 


1, 57595801 3e-04 


3,151909600e-04 


7 


-3,712986200e-07 


-4,347855338e-07 


1 ,769922383e-04 


3,539837500e-04 


8 


-4,468735700e-07 


-4,97227631 5e-07 


1,981738606e-04 


3,9634691 00e-04 


9 


-5,697322600e-07 


-5,6651 20852e-07 


2,211 84731 3e-04 


4,423685900e-04 


10 


-6,300390500e-07 


-6,434325428e-07 


2,460231 190e-04 


4,920452500e-04 


11 


-6,67781 8900e-07 


-7,288739425e-07 


2,7261 15927e-04 


5,452220800e-04 


12 


-6,770656500e-07 


-8,2381 64355e-07 
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O OO A OO A "7 A A z. O *1 

2,204804700e-04 


a /I / OAOACH H « A A 

-6,44828251 1e-04 


O AO/ITH AAAAn OH 

2,68451 6609e-01 


C AAAAAA H AAz. OH 

5,369022489e-01 


ohq 


O A7AAAO /nA- o ~7 

-2,272228400e-07 


A AAA/1A/1 H rA« O/l 

-9,3224941 52e-04 


O "7 A A A A H A77„ OH 

2,709231 377e-01 


C H H O/ICAAAAn AH 

5,418450832e-01 


OC\A 

dun- 


O OOAAAACAA^. o >1 

-2,380696500e-04 


H A0007iH H A« OO 

-1 ,2323741 1 0e-03 


O ^OOAOOAOOn OH 

2,733682692e-01 


C /(A70C0/IA0n AH 

5,467353463e-01 


one 
duo 


A AOOAAAOOOn O >1 

-4,930996000e-04 


H (7 /I H AAAA7A^ OO 

-1 ,544908970e-03 


O TA^OAA^AAn OH 

2,757860720e-01 


C J7H /7~7H AH H (7/. AH 

5,515710115e-01 


OC\P. 
d\JO 


—j ACOOOOOOOn O >1 

-7,653038000e-04 


H OAArH7AH H « OO 

-1 ,86951761 1 e-03 


O ~7AH 7rrAAA« OH 

2,781 755328e-01 


C J7AA /1AAAAO/. AH 

5,563499928e-01 


OCV7 

d\)f 


H AC/ITOOAAAn OO 

-1 ,054538000e-03 


O AA(70AAAA A ^ OO 

-2,205822384e-03 


O AATOCAmA^ OH 

2,805356979e-01 


r- OH A7AAAA /I z. AH 

5,610702634e-01 


ono 


-i OAOCHAOOOn OO 

-1 ,36051 9200e-03 


o rcoiinoo^On oo 

-2,553403843e-03 


O AAAArrOOA/. OH 

2,828655839e-01 


C AC7AAAH AH z. AH 

5,6572991 61 e-01 


onQ 


H AOOOOOOOOn OO 

-1 ,683383000e-03 


OOHHOOHOOO** OO 

-2,91 1801683e-03 


O OCHA/lOOOOn OH 

2,851 640880e-01 


r~ -"7AAAAAAAA /. OH 

5,703269839e-01 


01 n 


O AOOAH / AAA/. OO 

-2,02261 4600e-03 


O AOACH A A H O « OO 

-3,28051 461 8e-03 


O A7/IAAA /1AAn OH 

2,874303460e-01 


C 7/IOrAd /lAOn AH 

5,748594403e-01 


dA 1 


O A77AAACAA« OO 

-2,377899500e-03 


O A(7AAAAA H A« OO 

-3,659002949e-03 


O OAAA007/I /In OH 

2,896633744e-01 


c lAoorrrAA/. ah 

5,793255568e-01 


010 
ill il 


O 7i(07A77AAn OO 

-2,748797700e-03 


A A /I AAAAAAT^ OO 

-4,046686925e-03 


O OH AAAH OOOn OH 

2,91 8621 898e-01 


C AA7AA H A ~7 /7 /. AH 

5,837231 875e-01 


01 Q 


O HOd7A7CAA« OO 

-3,134797500e-03 


/I /I H AACAA^ C« OO 

-4,442950245e-03 


O O/lOOCOOOOn OH 

2,940258980e-01 


C AOA(7A(7 /I /1A/. AH 

5,880505443e-01 


Oi A 
ill 4 


O COCOOAOOOn OO 

-3,535329200e-03 


A O il 7H ylOOOO— OO 

-4,8471 40983e-03 


O OAHCO/lOCOn OH 

2,961 534858e-01 


C AAAAC77AC /. AH 

5,923057795e-01 


ill 


O A^ A7A /I AAA^ OO 

-3,949734800e-03 


C A(7A/7~7AOAH _ OO 

-5,258570891 e-03 


A AAA/1 A -i HA~7z. OH 

2,982441 187e-01 


C AA/lOAAOAHn AH 

5,964869261 e-01 


ill 


A A77AA H AAAn OO 

-4,377291 000e-03 


C A ~7 A /7 H A A A A OO 

-5,67651 8660e-03 


O AAAAA7TOA/. OH 

3,002967536e-01 


A AA/7AAAH r A/. AH 

6,0059231 52e-01 


01 7 
ill / 


A OH "7 H AAAAAn OO 

-4,81 71 22000e-03 


A HOOOOOA/lyl« OO 

-6,1 00233644e-03 


O AAAH AA~7(7 A ^ OH 

3,0231 06754e-01 


A A/1AAAH I" A7/. AH 

6,046201 587e-01 


01 Q 
ill 


C AAOC/I AOAAn OO 

-5,268542300e-03 


A rAAAOOAA7/. OO 

-6,528933067e-03 


O O/lOO/lOOO/ln OH 

3,042849004e-01 


A AOITAO/tOOAn AH 

6,085684896e-01 


01 Q 

iii y 


C 7AA A TAAAA^ OO 

-5,730478300e-03 


A A A H OA7AAA« OO 

-6,961 807609e-03 


O AAAH OCO/l7n OH 

3,0621 85347e-01 


A H A/1A(7AH "7-7/. AH 

6,1 243581 77e-01 


OOn 
ililU 


O OAOAACHAAn OO 

-6,2020051 00e-03 


—f OOOOOOOOO** OO 

-7,398022339e-03 


O OOHHAOAO/ln OH 

3,081 108034e-01 


A H AAAAAA/7 A /. AH 

6,162202954e-01 


001 
ilill 


O AOHAOAOOOn OO 

-6,681 936000e-03 


—f OOA7H AA^ /In OO 

L -7,83671 9044e-03 


O OOAAAOHAOn OH 

3,0996081 23e-01 


A HAAOOOCOOn AH 

6,199202538e-01 


OOO 
ddd 


-7 -4 A 7A H dCAAn OO 

-7,1 6791 4500e-03 


O A77AH AAOOn OO 

-8,27701 6692e-03 


O H H 7A7AA7A ^ OH 

3,1 1 7676973e-01 


A AATO A H ylOAn AH 

6,235341 430e-01 


OOQ 
ddo 


-7 AAAA/ICCAAn OO 

-7,662045500e-03 


O ~7H AAH AA(7H ^ OO 

-8,71 801 9351 e-03 


O H OCAAAAA7« AH 

3,135308027e-01 


A A7AAAAA A A/. AH 

6,270602942e-01 


ddQ 


O H AOOOAOOOn OO 

-8,160839200e-03 


O HCOOHHCOOn OO 

-9,1 5881 1532e-03 


O H rOi(AH7/IOn OH 

3,1 52491 748e-01 


A OA ^ A7AAO /I /. AH 

6,304970384e-01 


ililO 


O AAOHAOCOOn OO 

-8,6631 90500e-03 


O CAO /I ATAOA^ OO 

-9,598465636e-03 


O H A A A A H OOOn OH 

3,1 69221 282e-01 


A OOO/IAA/ICH n AH 

6,338429451 e-01 


OOP. 
ddO 


O H AAACA7AA« OO 

-9,169050700e-03 


H OOOAO/lOOOn OO 

-1 ,003604382e-02 


O HOC/lOOOOOn OH 

3,185488880e-01 


A A7AAA H A X A/. AH 

6,370964646e-01 


00~7 
ddi 


O A7C H OH |~AA_ OO 

-9,6751 31 500e-03 


H A A "7A/7AOAA,. OO 

-1 ,047059800e-02 


O OOHOO^OOAn OH 

3,201 287389e-01 


A HAA(7AH AA/7/. AH 

6,402561 665e-01 


OOQ 

ddo 


H AH OH OH OOOn OO 

-1 ,018101800e-02 


H A A A H H 77A An OO 

-1 ,0901 17730e-02 


O OH AAAAAT7/. AH 

3,21 6609657e-01 


A >1 oooo r~ AO r~ «. OH 

6,433205605e-01 


OOQ 
dd\3 


-1 ,0oo5v3o400e-02 


-1 ,1 3^682897e-0^ 


O OOH /l/IOCOHn AH 

J,i:o144ooo1e-01 


A /ICOOOOOCOa AH 

o,4o2ooooov3e-01 


ilOU 


H H H HA7/ AAA/. OO 

-1 ,1 18674000e-02 


H H 7/IArA7dn^ OO 

-1 ,1 74659748e-02 


O A H T7A7H (7~7z. OH 

3,2457971 57e-01 


A /I AH (7 A A A A A /. AH 

6,491 580606e-01 


OQ 1 
do\ 


1 1 £QQ77^nn.o no 
- 1 , i ooo/ / ouue-Uii 


1 oic^Q^QQQno no 
- \ ,d \ oyooooue-Uii 


Q 0^Q£/1QI^7^a C\A 

o,iioyo4yo/oe-u i 


« c;-) Q0Qc^/i/inc» m 
o,o i yiio044ue-u i 


f > r K f > 


-1,217496400e-02 


-1,256469358e-02 


3,272998929e-01 


6,545983553e-01 


233 


-1, 265891 600e-02 


-1,29611 41 77e-02 


3,285838962e-01 


6,571 66481 0e-01 


234 


-1,313420500e-02 


-1, 33479582 1e-02 


3,2981 64308e-01 


6,59631 5503e-01 


235 


-1, 359941 000e-02 


-1,372423489e-02 


3,309969604e-01 


6,619924903e-01 


236 


-1,40531 31 00e-02 


-1,408908330e-02 


3,321 248591 e-01 


6,642482877e-01 


237 


-1,449398400e-02 


-1,444163360e-02 


3,331 996202e-01 


6,663978696e-01 


238 


-1, 492061 500e-02 


-1,478104480e-02 


3,342207968e-01 


6,684402227e-01 
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Index 


32-Band Interpolation FIR 


LFE Interpolation FIR 


Perfect Reconstruction 


Non-Perfect Reconstruction 


64 x Interpolation 


128 x Interpolation 




4 COO 4 "70 JTOO oo 

-1 ,533170500e-02 


A CH AA/IA700« AO 

-1 ,51 0649733e-02 


O 0r"H07A-<0A— AH 

3,351879120e-01 


A 7A07/10A0Cn AH 

6,703743935e-01 


OA C\ 


a norm r/\rt. ao 

-1 ,572581 500e-02 


A r /IH 7AAOOH « AO 

-1,541 720331 e-02 


O O A A A A A OAO n AH 

3,361 004293e-01 


A 70HAA/1077n AH 

6,721 994877e-01 


OA 1 


a a h aoo ooaa n a o 

-1 ,610200000e-02 


A C7H A/1AA/IAn AO 

-1 ,571 240649e-02 


O OAACOAOAAn AH 

3,369580209e-01 


A 70AH /lA7AAn AH 

6,7391 46709e-01 


OA O 


a a /i rfiAonnn^ a o 

-1 ,645893800e-02 


A CAAH OOOOA« AO 

-1 ,599138230e-02 


O O 77 A AO AA /In AH 

3,377602994e-01 


A 7CCHAOOOAn AH 

6,7551 92280e-01 


OA 


-1 ,6795481 00e-02 


A AACO/I ii7H A« AO 

-1 ,62534471 6e-02 


O OOrAAfH7fln AH 

3,3850681 78e-01 


A 77AH OOOCH n AH 

6,7701 22051 e-01 


OA A 


A 7 H -4 AT )1 OAA AO 

-1 ,71 1054800e-02 


A A (IA7AT i(7A- OO 

-1 ,649795473e-02 


O OAH A7Ay1AA AH 

3,391 972482e-01 


A 700AOAAr"An AH 

6,783930659e-01 


OA £ 


-1 ,74031 2600e-02 


A A7A/I AA/I AC« AO 

-1 ,672429405e-02 


O OAOOHOAOAn AH 

3,39831 2926e-01 


A 7AAAH H TilOn AH 

6,79661 1 548e-01 


OA Pi 

dn-0 


-1 ,767225900e-02 


H AAOHAOOOH** AO 

-1,6931 90821 e-02 


O ilAdAAACOAn AH 

3,404086530e-01 


A AAOH m7CCn AH 

6,8081 58755e-01 


OA 7 


A 7 A H 7(T70nA« AO 

-1 ,791 707200e-02 


A ~7 H AAA7CACn AO 

-1 ,71 2027565e-02 


O /1AAOAAOH /In AH 

3,40929031 4e-01 


A OHOfTAAAHOn AH 

6,81 856691 8e-01 


OA Q 


H O H 00"7rr"700« OO 

-1 ,813675700e-0^ 


-1 ,7^889<i699e-0^ 


O vlHOAOOHAHn AH 

v3,41o9^191e-01 


A 00700AA7"n AH 


OA Q 


A OOOA^^OOn A o 

-1 ,833061 700e-02 


A 7/IA7/IA7Cr« AO 

-1 ,743743755e-02 


O A A 7A7A777n AH 

3,41 7979777e-01 


A OACA/irA^An AH 

6,835945249e-01 


do\J 


a A/1A7A7/1aa^. a a 

-1 ,849797400e-02 


H 7rAC/10i(0A« AO 

-1 ,756543480e-02 


O /lOH/lAHOO/ln AH 

3,421 461 284e-01 


A A A OA AO OAO n A A 

6,842908263e-01 


0£1 

do\ 


-1 ,8638301 00e-02 


A T/"" 70AA0A C « AO 

-1 ,767260395e-02 


O /lO/lOA/lAO/ln AH 

3 ,424364924e-0 1 


A 0/I07H CC/( yln AH 

6,84871 5544e-01 


OElO 

dod 


-1 ,875108700e-02 


H 77TOACOH A« AO 

-1 ,77586581 6e-02 


O /lOAAOACAAn AH 

3,426689506e-01 


A OCOOATOAOn AH 

6,853365302e-01 


Oi^O 
dOO 


-1 ,883604000e-0^ 


h 70ooonnfAn ao 

-1 JodooWooe-Qd 


O /lOOdO/HA^n AH 


A OCCOCOAOOn AH 


OElA 


A OOAOH con^n ao 

-1 ,88931 5300e-02 


A 7f)AAAA7CA« AO 

-1 ,786663756e-02 


O /10ACA70HOn AH 

3,42959731 8e-01 


A OCAHOAOOHn AH 

6,8591 80331 e-01 


0££ 
dOO 


A OOOH CA /"> r\ /"\ _ ao 

-1 ,892151600e-02 


H 7000A00A^« AO 

-1 ,788828894e-02 


O /10AH7A0rr0n AH 

3,4301 79358e-01 


A OAAO/l/l/lHAn AH 

6,86034441 0e-01 


dOO 


1 ,8921 51 600e-02 


h 7nnnonnA/ ^ ao 

1 ,788828894e-02 


*iooH7AorrOn oh 

3,4301 79358e-01 


A OAAO/1 A A A An AH 

6,86034441 0e-01 


do I 


A OOnOH COrtA^ A o 

1 ,88931 5300e-02 


A 7AAAAA7C A^ AO 

1 ,786663756e-02 


O j100r"A70HOn OH 

3,42959731 8e-01 


A OCAHOAOOHn AH 

6,8591 80331 e-01 




1 ,883604000e-02 


H TAAOAAAAA^ AO 

1 ,782339066e-02 


O iIAO^AiIH A/In OH 

3,4284341 34e-01 


A ATAArAAAA AH 

6,856853962e-01 


Oi^Q 


A 07CH A07AAn A O 

1 ,8751 08700e-02 


A 77COACOH An OO 

1 ,77586581 6e-02 


O /lAAAOACAAn AH 

3,426689506e-01 


A OCOOATOAOn AH 

6,853365302e-01 




H OAOOOAHAAn A O 

1 ,8638301 00e-02 


A 7A7AAAOACn AO 

1 ,767260395e-02 


O /lA/IOAylAO/ln AH 

3,424364924e-01 


A 0/107HrrC/1/1n AH 

6,84871 5544e-01 


OC1 


A O / A7A7^AA^ AO 

1 ,849797400e-02 


A 7CAC^Oi10An OO 

1 ,756543480e-02 


O /1AH/lAHOO/1n OH 

3,421461 284e-01 


A O/IOAAOOAOn AH 

6,842908263e-01 


dod 


H OOOAA'I 7 A A f\r\ 

1 ,833061 700e-02 


A 7 /I 07 ^ A7CC^ OO 

1 ,743743755e-02 


O A A 7A7A777n AH 

3,417979777e-01 


a oocA/irro/iAn ah 

6,835945249e-01 


OCQ 


h OA oa7C7aa,^ a o 

1 ,813675700e-02 


A ^OOOAOAAAn OO 

1 ,728892699e-02 


O /IHOAOOHAHn AH 

3,413922191e-01 


A 00700007An AH 

6,827830076e-01 


Ofi/1 
£iD4 


H 7A70AAn A O 

1 ,791707200e-02 


A ~7 A OAA7CATn OO 

1 ,712027565e-02 


O /lOAOAAOH /In OH 

3,40929031 4e-01 


A OHOrTAAAHOn AH 

6,81 856691 8e-01 


OC£ 


H 7A70ncnAA« A O 

1 ,767225900e-02 


A AAOH AAOOH n AO 

1 ,6931 90821 e-02 


O /lO/IAOACOOn OH 

3,404086530e-01 


A OAOI T07CCn AH 

6,8081 58755e-01 


*iOO 


H 7dAOH OOAAn OO 

1 ,74031 2600e-02 


A A7Ad AAd AC^ AO 

1 ,672429405e-02 


O OAOOHOAOAn AH 

3,39831 2926e-01 


A 7AAAHHC/10n AH 

6,79661 1548e-01 


OC7 

do/ 


A ~~7 A H AC /I OAA^ A o 

1 ,71 1054800e-02 


H A/IA7AC/I70n OO 

L 1 ,649795473e-02 


O OAHA70/iOAn OH 

3,391 972482e-01 


A 70OAOOACAn AH 

6,783930659e-01 




H r"7Ar <1 OH A A oo 

1 ,6795481 00e-02 


A AACA ^ i17H An OO 

1 ,62534471 6e-02 


O OOCAAOH70n AH 

3,3850681 78e-01 


A 77AH OOACH n AH 

6,7701 22051 e-01 




1 ,645893800e-02 


A CAAH OAOOOn OO 

1 ,5991 38230e-02 


O 077AA0AA/1n AH 

3,377602994e-01 


A 7CCHAAOOAn AH 

6,7551 92280e-01 


07fl 
d/U 


h ahaoaaaaa« ao 

1 ,610200000e-02 


A r"~7W A AA / A^ AO 

1 ,571240649e-02 


O OAACOAOOAn AH 

3,369580209e-01 


A 7AAH A A 7 A A n AH 

6,7391 46709e-01 


071 


H C70C0H PAAn OO 

1 ,572581 500e-02 


A CiH 7AAAH n OO 

1 ,541 720331 e-02 


O OAHAA/IOAOn OH 

3,361 004293e-01 


A 70HAA/1077n AH 

6,721 994877e-01 


070 
did 


A COOH 7 A C A A A o 

1 ,5331 70500e-02 


A CHOAvfA^OOn AO 

1 ,510649733e-02 


O OCHA7AHAAn AH 

3,351 8791 20e-01 


A 7AA7/10AOCn AH 

6,703743935e-01 


070 
d/O 


H /I AAAAH PAAn AO 

1 ,492061 500e-02 


A A ~7 Ci A f^t A dOA^ AO 

1 ,4781 04480e-02 


O 0/100A7AAOn AH 

3,342207968e-01 


A AO/1/1A0007n AH 

6,684402227e-01 


0~7A 
dl 4 


H /I /I AAAA /I AA^ OO 

1 ,449398400e-02 


A A A A H AOOAA— . OO 

1 ,4441 63360e-02 


O OOHAAAOOAn OH 

3,331 996202e-01 


O AA0A70AAAn AH 

6,663978696e-01 


d/O 


H /lACO^HAAn OO 

1 ,40531 31 00e-02 


A A AOAAAAAA^ OO 

1 ,408908330e-02 


O OAHO/iACAHn AH 

3,321 248591 e-01 


A A/10/10A077n AH 

6,642482877e-01 


d/O 


H OCAA>( H AAAn OO 

1 ,359941 000e-02 


A nOylAA^OAn AO 

1 ,372423489e-02 


O OOAAAAAA/ln AH 

3,309969604e-01 


A AHAAO/IAAOn AH 

6,61 9924903e-01 


0~7~7 

dl I 


a ah o /i onrA^^ a o 

1 ,313420500e-02 


A OOOAmOH n AO 

1 ,334795821 e-02 


O OAOHA/IOOOn OH 

3,2981 64308e-01 


A CAAH CCAAn AH 

6,59631 5503e-01 


07Q 
£1/0 


A OACOAH A A A AO 

1 ,265891 600e-02 


A OAAH AAA 77 n AO 

1 ,2961 14177e-02 


O OOCOOOAAOn AH 

3,285838962e-01 


A CT7HAA/10HAn AH 

6,571 66481 0e-01 


07Q 


A OH 7j)AA^AA« OO 

1 ,217496400e-02 


H ACAil AAACOn AO 

1 ,256469358e-02 


O A7AAAAAAAn AH 

3,272998929e-01 


a C/icrAoorrcrOn ah 

6,545983553e-01 




A A A0077CAAn OO 

1 ,168377500e-02 


A OHCACOOOAn AO 

1 ,215953380e-02 


O OCAA A AC7T^ AH 

3,259649575e-01 


A CTHAAOrT/1/IAn AH 

6,51 9285440e-01 


OQ 1 

do\ 


A A A Clf*^ A f\f\f\ — . OO 

1 ,1 18674000e-02 


A H7*1ACA7*10n AO 

1 ,174659748e-02 


O A/1C7A7H C7n AH 

3,2457971 57e-01 


A /lAHCOAAAAn AH 

6,491 580606e-01 


OQO 

dod 


1 ,068536400e-02 


A HOOAOOOA7n AO 

1 ,132682897e-02 


O OOH/l/1Ar"OHn OH 

3,231448531e-01 


A /IAOOOOOr"On AH 

6,462883353e-01 


OQO 
*100 


A f\A CtA F\A AAAn OO 

1,018101800e-02 


A AAAH H 77AA m OO 

1 ,0901 1 7730e-02 


O AH AAAAAC7n AH 

3,216609657e-01 


A il AAAACAACn AH 

6,433205605e-01 


OQvl 
dOH- 


O A7CH OH PAAn OO 

9,6751 31 500e-03 


A A^7ACAAAAn AO 

1 ,047059800e-02 


O OOHOA700An AH 

3,201 287389e-01 


A /lAOrTAHAACn AH 

6,402561 665e-01 


OQ£ 


9,169050700e-03 


A AAOAA^OOOn AO 

1 ,003604382e-02 


O HOrT/IOOOOOn OH 

3,185488880e-01 


A A7AAA/1A/1An AH 

6,370964646e-01 


doO 


O AAOH OOCOO« OO 

8,6631 90500e-03 


A CAOylACAOAn OO 

9,598465636e-03 


O A AAOOH OOAn OH 

3,1 69221 282e-01 


A OOO/IAA/ICHn AH 

6,338429451 e-01 


do I 


O H AOOOOOOO— OO 

8,160839200e-03 


A HrOAH HCOOn OO 

9,15881 1532e-03 


O Hf - A/IAH7j10n AH 

3,1 52491 748e-01 


A OA/1A7AOO/1n AH 

6,304970384e-01 


OQQ 


—j AAAA^CCAAn OO 

7,662045500e-03 


O 71 OAH AOT1 « AO 

8,71 801 9351 e-03 


O H OrrOAAAA7n AH 

3,135308027e-01 


A 07AAAAA/10n AH 

6,270602942e-01 


OQQ 

do\5 


—j j A7AH ^ CAA« OO 

7,1 6791 4500e-03 


A A77A H AAAOn OO 

8,27701 6692e-03 


O H H 7A7AA70n OH 

3,1 17676973e-01 


A OOCTO/1H /lOAn AH 

6,235341 430e-01 


OOC\ 


O OOHOOOOOO^ OO 

6,681 936000e-03 


-7 AAA7H AAd / n AO 

7,83671 9044e-03 


O AAAAAOH OOn OH 

3,0996081 23e-01 


A HAAOOOrrOOn AH 

6,1 99202538e-01 


OQ1 

d\5 1 


O OAOAACH 00« OO 

6,2020051 00e-03 


7 OOOAOOOOO— OO 

7,398022339e-03 


O OOHHAAAOjIn OH 

3,081 1 08034e-01 


A HAOAOAAr"/(n AH 

6,1 62202954e-01 


OQO 

d\5d 


C 70A/1700AAn AO 

5,730478300e-03 


A AAH OA7AAAo AO 

0,961 o07o09e-03 


O AAOH OCO/l7n AH 


6,1 <i4o5o1 77e-01 


OQO 


5,268542300e-03 


A irAOAAAAA7^ AO 

6,528933067e-03 


O O/IOO/IAAA/ln AH 

3,042849004e-01 


A AOCTAO/IOAAn AH 

6,085684896e-01 


OQ/1 

^y4 


/i qi 7i oonnrid no 


« -\ C\C\00.oaA A a. no 
Q, I UU<lOO044e-UO 


noo-inc7^/iA ni 


d,U4d^u 1 00 / e~u 1 


C vj 


4,377291 000e-03 


5,67651 8660e-03 


3,002967536e-01 


6,0059231 52e-01 


296 


3,949734800e-03 


5,258570891 e-03 


2,982441 187e-01 


5,964869261 e-01 


297 


3,535329200e-03 


4,8471 40983e-03 


2,961 534858e-01 


5,923057795e-01 


298 


3,134797500e-03 


4,442950245e-03 


2,940258980e-01 


5,880505443e-01 


299 


2,748797700e-03 


4,046686925e-03 


2,91 8621 898e-01 


5,837231 875e-01 


300 


2,377899500e-03 


3,659002949e-03 


2,896633744e-01 


5,793255568e-01 


301 


2,02261 4600e-03 


3,28051 461 8e-03 


2,874303460e-01 


5,748594403e-01 
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64 x Interpolation 
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o\Jd 


1 ,683383000e-03 


o-i-ioOHOOO« 00 

2,91 1801683e-03 


2,851 640880e-01 


n ^AOAAAOOAn AH 

5,703269839e-01 


oUo 


A OAACH O A A A A o 

1 ,36051 9200e-03 


O CCO^AOOdOn OO 

2,553403843e-03 


nooArrnoA^ ah 

2,828655839e-01 


C AC7AAAH AH ^- AH 

5,6572991 61 e-01 


oU4 


1 ,054538000e-03 


O OACOAOOO/ln OO 

2,205822384e-03 


O OAirorAA7A« AH 

2,805356979e-01 


C AH A7AAAA ^ - OH 

5,610702634e-01 


oUO 


—J /TOAOOAAAn f\ A 

7,653038000e-04 


1 ,86951761 1e-03 


O 70H7CC0A0n OH 

2,781 755328e-01 


C rAA /I AAAAA/. AH 

5,563499928e-01 


one 
oUo 


a a o o a o o a a a — . i*\ ji 

4,930996000e-04 


H CA Af\f\Ci(\~}(\r^ OO 

1 ,544908970e-03 


O 7C70AA70An OH 

2,757860720e-01 


C CHC^HAH A C/- AH 

5,515710115e-01 


oU/ 


A O O O A a a r~ A A — . f\ A 

2,380696500e-04 


A OOOO^^WHO— . OO 

1 ,2323741 10e-03 


O 7A OA AAAAA« OH 

2,733682692e-01 


r~ /( A7ATA /I AA- AH 

5,467353463e-01 


QflQ 

oUo 


r\ r\~7r\r\r\n yi A A n A~7 

2,272228400e-07 


O OAAjIAi) H TA« O A 

9,3224941 52e-04 


O 7AAAA A O ~7~7 ^ OH 

2,709231 377e-01 


C /I A A /I C AAA An AH 

5,418450832e-01 


QflQ 

ouy 


O OO A Clf\ A "7 A A O A 

-2,204804700e-04 


< /IflAOACH H n O A 

6,44828251 1 e-04 


O AO^CH AAAA« OH 

2,68451 6609e-01 


r- AAAAAA/IOAn AH 

5,369022489e-01 


ol U 


-4,24 1 331 700e-04 


^oooorooo** f\ a 

3,703625989e-04 


O ATACiIAiH r« OH 

2,65954941 5e-01 


C OHAAOOHAHn AH 

5,31 90881 01 e-01 


Q -M 


-6,1 0907bb00e-04 


1 ,090620208e-04 


O AO^OOHAOCn AH 

2,bo4JJ90ooe-01 


C AAOAA "7 AO A n A H 

5,268bD7936e-01 


ol d 


-7,810380900e-04 


-1 ,38905641 5e-04 


O AAAOAA7AA^ AH 

2,608896792e-01 


C OH 77AAnriTn AH 

5,217782855e-01 


ol o 


A A/lAAA/IAAAn C\ A 

-9,349224800e-04 


O 70^H OAOH On O A 

-3,7341 3931 8e-04 


O rOOAOOAAyln AH 

2,583232224e-01 


C H AA/CA7H An AH 

5,1 6645371 9e-01 


Q 1 A 

ol 4 


-1 ,072608600e-03 


r" A/|A77A(|AH _ O/l 

-5,943773431 e-04 


/~ , /~' - 7orrrr - 7/^o^-. ah 

2,557355762e-01 


C H H A~? f\ A OAA^ AH 

5,1 14701390e-01 


Q 1 £ 
ol 


H HA/l^O^^OOn A O 

-1 ,194737700e-03 


O f\H 7CAi(COOn O A 

-8,01 7504588e-04 


O CAH A7A H OO^- AH 

2,531 2781 33e-01 


c AAAr/icroOn oh 

5,062545538e-01 


Ol D 


^ o /^ -( /" o o >m /^ « ao 

-1 ,301 b23400e-0o 


n Air[ro707nOn f\A 

-9,9oo^7J708e-04 


A CACAAOD7An AH 

2,50500887be-01 


C AHAAA^AAAn AH 

5,010007b20e-01 


Ol / 


-i O A /I A H "70 A A OA 

-1 ,39401 7000e-03 


H H 7r7MA70>l« OO 

-1 ,1 75740734e-03 


O >I7ACCAAH ~~7 ~ AH 

2,47855901 7e-01 


,1 O IT ~7 H AAAAAn AH 

4,9571 08200e-01 


Q1 Q 
Ol O 


-1 ,478566700e-03 


H A/(AAAOAn OO 

-1 ,342460280e-03 


A /(CH AOOOOOn AH 

2,451 938838e-01 


a AAAAA7r/IAn OH 

4,903867543e-01 


Q1Q 

oi y 


-1 ,541 196600e-03 


H ^Ar7AOf^7« OO 

-1 ,495792647e-03 


O d ACH CA7AA^ AH 

2,4251 58769e-01 


A ACAAA7H ACn AH 

4,850307405e-01 


o^U 


1 ,592423900e-03 


H ^oirn7n AOA^ OO 

1 ,635878929e-03 


O AAOAAAA/I An OH 

2,398228943e-01 


*1 7AA/I /lOACAn AH 

4,796448052e-01 


O^ll 


1 ,632849400e-03 


H 7f , OOA^OOH n OO 

1 ,762894331 e-03 


O A7H A CA^AAn AH 

2,371 1 59792e-01 


A 7/1AAAA/ICH n AH 

4,742309451 e-01 


odd 


h a it a f ,h o n a a a o 

1 ,654649800e-03 


H 077A^AH Af\« OO 

1 ,8770461 90e-03 


O A/I AAAH TAAn AH 

2,343961 596e-01 


A O ~7 A HOACAn AH 

4,68791 3656e-01 


o^o 


A /* 1 7AOn7^ A A A O 

1 ,670887400e-03 


H A70C70r , AH _ OO 

1 ,978572691 e-03 


O AH AAvl ^700« OH 

2,316644788e-01 


*1 AAAAOAAAAn AH 

4,633280039e-01 


o^4 


h a ~7 a c /i o a a a a a 

1 ,676540900e-03 


O AA77 yl O ~7~7 ^ _ OO 

2,067740774e-03 


O AOAAH AA/IOn AH 

2,28921 9648e-01 


*1 C70/IAA^ AH n AH 

4,578429461 e-01 


QO£ 
O*l0 


1 ,672691 700e-03 


O A A A CI A / A7A« OO 

2, 1 44844970e-03 


O AAHAAAHAAn AH 

2,261 6961 60e-01 


*1 CAAOOOOOH n AH 

4,523383081 e-01 


odo 


A r>Cn7CH >1 A A AO 

1 ,659751 400e-03 


O AH AAAmOTn OO 

2,210205887e-03 


O AO/IAO^AAAn AH 

2,234084606e-01 


A /lAOHAAA^On AH 

4,4681 60272e-01 


Q07 

odl 


H AOOOAO./100n A A 

1 ,638393400e-03 


O AAd H A7DOH « OO 

2,2641 67881 e-03 


O AAAAACH H An AH 

2,2063951 19e-01 


*1 A A A7A H AAA/- AH 

4,41 2781 000e-01 


QOQ 


1 ,609496400e-03 


O OA7AA7H rr ^ OO 

2,3070971 95e-03 


O A 70AA7AA-I ^ AH 

2,1 78637981 e-01 


*1 AC7AAA7A/ln AH 

4,357266724e-01 


QOQ 

o^y 


H r"70"7H CAAAn A o 

1 ,57371 5600e-03 


O OOOO AA7AO _ OO 

2,339380793e-03 


O A CAOAA7AA^ AH 

2,150822729e-01 


A OAHAOAAHAn AH 

4,301 63621 9e-01 


ooU 


A ADA^ AA« A O 

1 ,531686400e-03 


O OAH ^ AOCAAn OO 

2,361 423569e-03 


O H AAATA/ACn AH 

2,122959495e-01 


A A/ICAH AA/(Cn AH 

4,24591 0645e-01 


ool 


A /I O /IH CAiCAAn A A 

1 ,4841 59500e-03 


O 070Aj(7H 7f)n OO 

2,3736471 78e-03 


O AACACO H H A ^ AH 

2,09505841 1e-01 


*1 A AAH A7AAAn AH 

4,190107882e-01 


QOO 

oo<i 


H /OH ~7H 7C AA^ A o 

1,431 71 7500e-03 


2,376487479e-03 


O f~\r^~7 A A A7H OH 

2,0671 2871 8e-01 


*1 H A/(A/l07AOn AH 

4,134248793e-01 


QOQ 

ooo 


H OTCATOOAAn A o 

1 ,375058300e-03 


O A7AOA/IAAOn OO 

2,370394068e-03 


O OOAH OACi(7n AH 

2,0391 80547e-01 


*1 A7AACAH CO/- AH 

4,0783521 53e-01 


QQ/1 

oo4 


H OH <fH OAf\f\« A o 

1,31 481 3200e-03 


ocirnorAOA^ 00 

2,355825622e-03 


O OH HOOOOOAn OH 

2,01 1223286e-01 


*1 AAA/l07AAAn AH 

4,022437930e-01 


QQ£ 
OOO 


1 ,251459700e-03 


O OOOATH AAH n OO 

2,333251 061 e-03 


A AOOOAAH^f - — . OH 

1 ,9832661 75e-01 


O AAACA/I AAAn AH 

3,966524303e-01 


QQC 


A H OOH A7AAA« OO 

1,182107000e-03 


O OOOH ylCOAAn OO 

2,3031 45360e-03 


A ACCAHAHAAn AH 

1,95531 91 96e-01 


O AH AAAAA/(7n AH 

3,910630047e-01 


QQ7 

00/ 


H WW /lAA^AArtn OO 

1,1 14606900e-03 


O AACAAAO H (T^ OO 

2,26598931 5e-03 


A AA7AA A A /I A^ AH 

1 ,927391 142e-01 


O OC/ ~7 ^ A CO C n AH 

3,854774535e-01 


QQQ 
OOO 


H A^CAOCCAAn OO 

1 ,045985500e-03 


O AAAAAAC A A « OO 

2,22226651 4e-03 


A OAA/AH AAOn AH 

1 ,899491 698e-01 


O 7AOA7rA/)An AH 

3,798975349e-01 


QQQ 

ooy 


9,7671 58300e-04 


O H7A/(AAH77n OO 

2,1 724621 77e-03 


A O^HAOACAAn AH 

1 ,871629506e-01 


O ~7 /I OACAAAAn AH 

3,743250966e-01 


Q/i n 
o4U 


o ATirrrn AAn o*i 

9,0725551 00e-04 


O A A 7AA H OOAn OO 

2,1 17061289e-03 


A OjIOAHOOAO— . OH 

1 ,84381 3360e-01 


O AO^AHAAAAn AH 

3,68761 9269e-01 


Q/1 1 

o41 


O OOOOH oooo« O *1 

8,38091 8900e-04 


O ACAC<(7AAOn OO 

2,056547208e-03 


A OH AACA/AAn AH 

1 ,816052496e-01 


O AAAAA7A/ A /- AH 

3,632097244e-01 




—j AArrA"?rAA^ o*i 

7,695597500e-04 


A AAHOAACA/(n OO 

1 ,991399564e-03 


A 700ACC/(HAn AH 

1 ,78835541 0e-01 


O C7A7AAAA A _ H 

3,576703668e-01 


Q/1 Q 
o4o 


-7 OO W OO^OOO « O *1 

7,021 067600e-04 


A AAAAAArH7n OO 

1 ,92209251 7e-03 


A ^AA^AAOAAn AH 

1 ,760730892e-01 


O CAH /IT/I AAAn AH 

3,521 454632e-01 


Q/1 /I 
o44 


O Or* OO A A 7AA « O *1 

6,360244700e-04 


A AilAAA^rAA- OO 

1 ,849094522e-03 


A 7AOH AAAAAn AH 

1 ,7331 86990e-01 


O /lAAOAAOAOn AH 

3,466366828e-01 


Q/1 £ 
o40 


C ~7W ~700H r\f\f\ ^ f\ A 

5,71 7321 000e-04 


A 77AAArOA/(- OO 

1 ,772865304e-03 


A 7AnAAAi(A- OH 

1 ,705732346e-01 


O /(H H /inCAAn AH 

3,41 1457539e-01 


Q/1 C 

o4d 


C OOWOO^COO^ O A 

5,091 327500e-04 


A OOOOCCOOO^ OO 

1 ,693855622e-03 


A A7AA7CAAAn AH 

1 ,678375006e-01 


O ACA7/I AOTAn AH 

3,356742859e-01 


Q/1 7 

o4/ 


A /lOO^ AOAAAn O A 

4,488403900e-04 


A A ACACCAA^ OO 

1 ,612505526e-03 


A ACHHOO^HAn AH 

1 ,651 1 22719e-01 


O AOOAOOAOAn AH 

3,302238286e-01 


Q/1 Q 

o4o 


O AAAAA H 000« O *1 

3,909221 300e-04 


A CAAA^OH AOn OO 

1 ,5292431 93e-03 


A AAOAAACOAn AH 

1 ,623983532e-01 


O A A 7AAAAH An AH 

3,24796021 0e-01 


Q/1 Q 

o4y 


3,353960600e-04 


A A A A A O A OO A — OO 

1 ,444484224e-03 


A CAAAATH A/In AH 

1 ,5969651 94e-01 


O HAAAAOOOHn AH 

3,1 93923831 e-01 


oOU 


O O^OOOfOOO— . O jl 

2,678985000e-04 


A ACOAOAAH 0~ OO 

1 ,35863001 9e-03 


A C7AA7TH (T A ~ OH 

1 ,5700751 54e-01 


O A A C~\ A A / ACA^ AH 

3,140144050e-01 


QC1 

ool 


O OOWHOOCOO^ f\ A 

2,231 1 29500e-04 


H A7AAA7/I AO« OO 

1 ,272067428e-03 


A rdAAAAOA/ln AH 

1 ,543320864e-01 


O AOAAOIT H "70 _ OH 

3,0866351 72e-01 


OOd 


H "7"7AAA H 000« O A 

1 ,772621 900e-04 


A H OCH A7Ai(On OO 

1,185167348e-03 


A CA A7AAAAA ^ AH 

1 ,516709626e-01 


O OOO/lHAAACn AH 

3,03341 2695e-01 


Qi^Q 
oOo 


H OH 4 C/IACAAn 0*1 

1 ,31 1540500e-04 


A AAOAO^AH ^ _ OO 

L 1 ,09828461 4e-03 


A A AAA A CI A A A n OH 

1 ,4902481 44e-01 


A AOA/lAAAAAn AH 

2,980490029e-01 


Qi^/1 

o04 


A oroH if O *1 00 — . O A 

1 ,052143400e-04 


A OH A ~7 H C H r\C> ^ OO 

1 ,01 1756598e-03 


A /I AAA/IA7AA. AH 

1 ,463943720e-01 


A AA^AOH HAH AH 

2,927881 181 e-01 


Q C C 

oOO 


OAOOOOOAAr, A£T 

b,898332300e-05 


n ArAAO/IAAAn A>1 

9,259Uo499be-04 


H /l070A0A10n AH 

1 ,4o7802o1oe-01 


2,8755992b5e-01 


oOb 


3,623958500e-05 


O /I H AA7H AC7« f\ A 

8,41 0271 257e-04 


A A A A OOH A~7rr^ AH 

1 ,41 1831 975e-01 


A AAAAC7AAAn AH 

2,823657692e-01 


Q£7 

oO/ 


0,0 I iiODOUUUe-UD 


7 ^7/1 -1 nQ/i77d r»/i 
/ ,0 / 4 1 uy4/ / e~u4 


-I QQ^nQ7C1QA H1 

I ,ooOUo / I oe-u I 


770H^Q07£q Hi 
d, / / duOvd 1 0G - U I 


OOO 


-2,010055900e-05 


6,7531 97522e-04 


1,360425949e-01 


2,720846236e-01 


359 


-4,214289700e-05 


5,949990009e-04 


1,335003078e-01 


2,670000792e-01 


360 


-6,1 57321 800e-05 


5,166754709e-04 


1, 30977481 6e-01 


2,619544268e-01 


361 


-7,793692700e-05 


4,40556381 4e-04 


1,284746826e-01 


2,569487989e-01 


362 


-9,130172200e-05 


3,668301 21 5e-04 


1,259924471 e-01 


2,519843280e-01 


363 


-1,018237000e-04 


2,956659591 e-04 


1, 23531 3043e-01 


2,470620573e-01 


364 


-1, 0974351 00e-04 


2,2721 42592e-04 


1,21 091 7681 e-01 


2,421 830446e-01 
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32-Band Interpolation FIR 
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Perfect Reconstruction 


Non-Perfect Reconstruction 


64 x Interpolation 


128 x Interpolation 


ODD 


h h coco/i / oo n r\ a 

-1 ,152534400e-04 


4 AH AAAA7AC^ o A 

1 ,616069785e-04 


1 ,186743453e-01 


2,3734821 38e-01 


ODD 


H H ocoooooOn O/ 

-1 ,185602000e-04 


O OAC7CCA((7n OC 

9,895755647e-05 


H H AA7A>IA77« OH 

1 ,162794977e-01 


A OOCCOCHO^n OH 

2,3255851 87e-01 


OO/ 


-1 ,200453700e-04 


O AAAH A A A A A n OC 

3,9361 82839e-05 


H H AAA7AA/0« OH 

1 ,139076948e-01 


O A70H A C~\t~\~?~? ^ OH 

2,2781 49277e-01 


oDo 


H OOTOT'T'OOOn O / 

-1 ,237377900e-04 


H ~~7 4 AH AOA-I An OC 

-1,71 01 9801 2e-05 


H 4 4 CCAAA07« OH 

1 ,1 1 5593687e-01 


O OA H 4 OOOOOn OH 

2,231 182903e-01 


ooy 


H ooooooooon o / 

-1 ,226936800e-04 


-7 OOT"OCOOOOn OC 

-7,037253090e-05 


H OOOOvlOOOC— OH 

1 ,092349365e-01 


O HO/AA/AAA- OH 

2,184694260e-01 


o/U 


-1 ,20201 8700e-04 


H OO/OylOHHO — A H 

-1 ,2040491 13e-04 


H AAAA/7AAA. OH 

1 ,069347933e-01 


O HOOOOHC/1C— OH 

2,1 38691 545e-01 


Q71 
Of 1 


H H A IT A A H 7AA. o / 

-1 ,1 65901 700e-04 


H A ~7 H AAA7AA« A A 

-1 ,671699720e-04 


H A/ACAAAA/n OH 

1 ,046593264e-01 


O AAAH OOOOOn OH 

2,0931 82206e-01 


Old. 


-1 ,1 19841200e-04 


A 4 AAT07C77« A H 

-2,1 06537577e-04 


H AA/AOAAOA« OH 

1 ,024089083e-01 


O A/OH 700/iTn OH 

2,0481 73845e-01 


o/o 


H flf'^fH /I ~7 A A n /-A A 

-1 ,06481 4700e-04 


A CAOrA7AAC« A A 

-2,508567995e-04 


H OOHOOOOOO** OH 

1 ,001838669e-01 


O AAAA7AH AA^-. OH 

2,0036731 66e-01 


o/4 


-1 ,002681 400e-04 


A A77AA A AAOn O / 

-^,8779o460oe-04 


O 7HO/IC07>IOn OO 

9,79845o748e-02 


H OCOAOAC77n OH 

1 ,959D8D577e-01 


Q7£ 


-9,341 758200e-05 


A AH /1AHHAA~7n A A 

-3,21491 1267e-04 


A rOH / A/AOOn OO 
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9,51 6004470e-03 


A Pl1 
4ol 


"7/ roo^AH r*AAn oa 

-7,533601 500e-06 


2,885844333e-05 


>J rH H H CAO/ITn OO 

4,51 1 1 59845e-03 


A OOOOOH OAOn OO 

9,022301 060e-03 


ACO 


-7 A7AAAA"7AA^ A /-> 

-7,678809700e-06 


2,718161704e-05 


A A7d ^ H 7H OH «, OO 

4,27441 71 81 e-03 


O J7 A AOH C70A^. OO 

8,54881 5730e-03 


A CO 
40o 


-7 /-i j OOIAAA^ A /-> 

-7,614387900e-06 


O (7(7 /I CAAA7A« /-> (~ 

2,554569073e-05 


A O/17/lAAAOO^ OO 

4,047499038e-03 


O OOdAOAOOOn OO 

8,094980380e-03 


4o4 


-6,046428700e-06 


2,395598858e-05 


O OOOOOH ooo« OO 

3,830091 329e-03 


—7 AAA^ ACOAAn OO 

7,6601 65890e-03 


vl CC 
4t>0 


-5,696789700e-06 


O A^ H 7AH AAOn f\C 

2,241 701 623e-05 


O AOH AA7A77n OO 

3,621 967277e-03 


~7 OvlOAHAHAAn OO 

7,24391 91 90e-03 


40D 


rr oh o^^rrooOn oa 

-5,210775600e-06 


O AAOACA A / H . /-> (— 

2,093250441 e-05 


O /IAA77AAOAn OO 

3,422776936e-03 


A O A CrOAAAA/. OO 

6,845539900e-03 


4t>/ 


/I A H 7AAAAAA^ A /-> 

-4,617880200e-06 


1 ,950545993e-05 


O OA AA7AC7T ^ OO 

3,232272575e-03 


A ylAdCOAH H An OO 

6,4645321 1 0e-03 


A CQ 


-3,953604500e-06 


H OH AAAAOOH n f\C 

1 ,81 3820381 e-05 


O ACAH PAAH H ~ OO 

3,05015291 1e-03 


A H AAAAOACAn OO 

6,100293250e-03 


4oy 


O ACOAOAAAAn r\r* 

-3,253336000e-06 


H AOOAjI <-\ -7-7 /-> ^ /-> (— 

1 ,683242772e-05 


O A7AH AAH AAn OO 

2,8761 90469e-03 


C ICAOAAOAA/. OO 

5,752369300e-03 


/I 7n 

4/U 


-2,555774300e-06 


h r~ r~ 000 0^ — r~ 

1 ,558924305e-05 


O "7 H AM AO A7 A OO 

2,710093278e-03 


r* HAAH7dAAAn OO 

5,4201 74920e-03 


A 71 

4/1 


H AAilOAAAAAn /-\ /-> 

-1 ,904890000e-06 


H <AAAH AA/I- »*i r~ 

1 ,440921 824e-05 


rc H rA7A H o« OO 

2,551 567042e-03 


C HOOHOOAHAn OO 

5,1 031 2291 Oe-03 


/I 70 

4/2 


j AAA777AAA«» /~\ /-> 

-1 ,306777300e-06 


H AAAA^AAAAn /-> (~ 

1 ,329243969e-05 


O H AAHOO H H A^ OO 

2,40043341 2e-03 


A OAAOCACOAn OO 

4,800856580e-03 


/I 70 

4/o 


-2,448299500e-07 


h AAonrrA7A^ oc 

L 1 ,223855270e-05 


O OCAArAA7A^ OO 

2,256359672e-03 


/I CH A7HAAAAn OO 

4,51 271 0030e-03 


/I 7/1 

4/4 


-1 ,007547900e-07 


1 ,124680875e-05 


O H H AH CH 07C- OO 

2,1 19151875e-03 


A OOOOA/lAAAn OO 

4,238294900e-03 


/I 7C 

4/0 


-1 ,643020800e-08 


H OOHOHHOOO^ f\C 

1 ,03161 1009e-05 


H AOOAA^H AA« OO 

1 ,9886041 99e-03 


O A77AAA A DA^- OO 

3,977200480e-03 


A IP. 
4/D 


-3,71 5261 200e-09 


O >1 < CACAAA7« OO 

9,445050637e-06 


H OA /I A77CAAn OO 

1 ,864377526e-03 


O 7AA7 A 7H HAn OO 

3,7287471 40e-03 


/I 77 

4/ / 


/-i AAAOrCAAA« OA 

-6,820855300e-09 


O AOH AAylH AA« OO 

8,631 9641 92e-06 


H 7/1 AAA7CAA™ OO 

1 ,746327500e-03 


O d AAA i(7CrAn OO 

3,492647550e-03 


/I 7Q 

4/o 


3,377455500e-09 


—j A7/(AAAOAAn OO 

7,874960829e-06 


H AA/IAH H TH An OO 

1 ,63421 1512e-03 


O AAOd H ACH An OO 

3,26841 651 Oe-03 


A 7Q 

4/y 


O AAOAAA^ AAn f\f\ 

2,668096400e-09 


—j H 7H AA^AOAn OO 

7,171964626e-06 


H CA7AAA7A A — . OO 

1 ,527829794e-03 


O ACCATAAAAn OO 

3,055653300e-03 


4oU 


A A -4 7AOAAA« (*\0 

1 ,61 1726200e-09 


O CAA7rCAAA^ OO 

6,520755960e-06 


H vIOAOOOOOO— . A A 

1 ,426893868e-03 


O Of~O^OHOOOn OO 

2,853781 920e-03 


A Q 1 

401 


6,821 591 800e-10 


C A1 OAA/IO/IOft OO 

5,91 o994^4oe-0o 


H OOH OCOAH /In OO 

1 ,331 25891 4e-03 


A AAACH AO/1 An OO 

2,dd251 2240e-03 


/I QO 

4o2 


-4,147967800e-08 


C OA/tATACAAn OO 

5,364252502e-06 


H Ad AA7AOCH _ OO 

1 ,240676851 e-03 


O /iOH A/IOOH An AO 

2,481 34881 Oe-03 


vl QO 
4oo 


- 1 ,u<io 1 4yuuue-u / 


4,o04U0U0oUe-UO 


-1 iccnooc7Qc 1 no 
1 , 1 oou^oo/ye-uo 


o-inr»/iocnno no 
c.,o \ uu4iiouue-uo 


tot 


-3,167104300e-07 


4,385879038e-06 


1,073930296e-03 


2,147856400e-03 


485 


-8,4544801 00e-07 


3,957220997e-06 


9,974770946e-04 


1 ,994950230e-03 


486 


-1, 560941 800e-06 


3,565570978e-06 


9,251 192096e-04 


1 ,850234690e-03 


487 


-8,920065800e-07 


3,208459020e-06 


8,5701 10658e-04 


1,71 401 8640e-03 


488 


-9,822894700e-07 


2,883470643e-06 


7,92851 671 6e-04 


1 ,585700080e-03 


489 


-1,124542400e-06 


2,588257530e-06 


7,3241 48901 e-04 


1 ,464826870e-03 


490 


-1,144180200e-06 


2,320550948e-06 


6,75551 5351 e-04 


1,351 100280e-03 
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Index 


32-Band Interpolation FIR 


LFE Interpolation FIR 


Perfect Reconstruction 


Non-Perfect Reconstruction 


64 x Interpolation 


128 x Interpolation 


A Q1 


-1 ,1 19829700e-06 


A f~\~7 C) A ~7H ~~7 A ~7 ^ A A 

2,0781 71 747e-06 


6,2221 30032e-04 


A f^i A A ^ AO AAA^ AO 

1 ,244423330e-03 




-1 ,052683900e-06 


A OCAAO^AAH _ A A 

1 ,859034001 e-06 


5,721 592461 e-04 


A H d ylAH AH AA« AO 

1 ,144316160e-03 


/I QQ. 


-9,544782800e-07 


A aah A rr\"70A^ aa 

1 ,661 1 59786e-06 


5,252459669e-04 


A ACAdOAOdAn AO 

1 ,050489840e-03 


/I Q/1 


-8,3486061 00e-07 


1 ,482681 1 14e-06 


>J (H A77AH AAn A ^ 

4,81 27761 69e-04 


A AACCH /( AA<» A A 

9,625531 400e-04 


/I Q£ 

4yo 


—j a A a— i a f~ A A> A ^. A ~7 

-7,034745600e-07 


a oa^^aah a a 

1 ,321 84061 4e-06 


4,401 875485e-04 


A OA07AAAAA« A A 

8,803732300e-04 


4yo 


r~ c A A H A a "7 A A — . A^ 

5,5861 46700e-07 


A A ^AAAAAA A AA 

1 ,176999604e-06 


A A H A A A H A^ f\ f\ A 

4,01 8281 470e-04 


A AAAr il AAAA. A A 

8,036546600e-04 


a on 
4y / 


A j A A ~7 A H A A A A^ 

6,1 93701 600e-07 


1 ,046637067e-06 


O ACOAAHHAH^ A A 

3,658991 191 e-04 


—f AH 7AA7AAA« A A 

7,31 7967800e-04 


4yo 


a aahacaoaa^. a^ 

6,601 852300e-07 


a oao d h a a a .-^ a^ 

9,29341 6952e-07 


3,328395542e-04 


A A/"A77ArAA^ A A 

6,656776500e-04 


4yy 


6,770656500e-07 


A AOOH AdOCCn A^ 

8,2381 64355e-07 


O OH AOAAH ^ A A /I 

3,01 38631 70e-04 


A r\ A ~7~7 H ^ 4 AAn A/1 

6,02771 41 00e-04 


ouu 


0,67781 8900e-07 


—7 00070AylOCn A "7 

7,^oo7o94^5e-07 


2.Jdb \ 1 59^7e-04 


5,4522208u0e-04 




6,300390500e-07 


a /in /Ioat /i aa^ a^ 

6,434325428e-07 


A / AAAOH HAA~ A A 

2,460231 190e-04 


A AAAdCACAA« A/1 

4,920452500e-04 




5,697322600e-07 


C AAC A AAOCA^. A^ 

5,6651 20852e-07 


A AHHA>i^OHO« A A 

2,21 1 84731 3e-04 


/I /IAAAOCAAA« A A 

4,423685900e-04 


DUo 


./I *1 A O^O C ~7 A A _ A~7 

4,468735700e-07 


>t a^aa^aoh rr ^ a^ 

4,97227631 5e-07 


A AA-( 7AflAAA^ A a 

1 ,981738606e-04 


A AAOdAAH AA« A A 

3,9634691 00e-04 


0U4 


o, / 1 iiyob^UUe-U / 


4,o4/oDooooe-U/ 


i 7CQQOO'30'Jn C\A 

\ , /byy^oooe-u4 


o ,00380 /ouue-U4 


QUO 


5,7461 47600e-07 


3,784752209e-07 


1, 57595801 3e-04 


3,151909600e-04 


506 


2,771 560000e-07 


3,2766891 42e-07 


1 ,397485757e-04 


2,794966000e-04 


507 


9,130198800e-08 


2,81 81 4351 4e-07 


1, 233371 440e-04 


2,466738200e-04 


508 


2,529296600e-08 


2,404238444e-07 


1,082170274e-04 


2,164336300e-04 


509 


8,358679600e-09 


2,030677564e-07 


9,43932391 2e-05 


1 ,887860900e-04 


510 


-7,1 387421 00e-11 


1 ,693738625e-07 


8,179365250e-05 


1, 6358691 00e-04 


511 


-1,140033200e-10 


1,3901 91 784e-07 


2,658434387e-04 


5,31 68571 00e-04 



D.9 1 024 tap FIR for X96 Synthesis QMF 



Coef # 


Coefficient Value 


1 


-7,1 279389866041 690e-8 


2 


-7,09509031 50874990e-8 


3 


-7,9525034321 375090e-8 


4 


-8,9326871 281 374790e-8 


5 


-9,74831 90494874640e-8 


6 


-1,058665951 0721 950e-7 


7 


-1,1 52736341 1865760e-7 


8 


-1,25231 0964561 5350e-7 


9 


-1,35556361 37971 380e-7 


10 


-1,4640931 8211 91 640e-7 


11 


-1,57874249981 2371 Oe-7 


12 


-1 ,6994592926571 220e-7 


13 


-1,82641 8431 101 81 50e-7 


14 


-1 ,9600594587757850e-7 


15 


-2,1007899706006800e-7 


16 


-2,2489759047553620e-7 


17 


-2,4050245877923400e-7 


18 


-2,569387461 5777660e-7 


19 


-2,7425370308549580e-7 


L 20 


-2,9249662541 757600e-7 


21 


-3,1171981968268900e-7 


22 


-3,31 978640404731 80e-7 


23 


-3,53331 34588843600e-7 


24 


-3,758391 046541 21 80e-7 


25 


-3,99566281 21 6691 90e-7 


26 


-4,24580452961 59330e-7 


27 


-4,5095249563698380e-7 


28 


-4,78756591 12606760e-7 


29 


-5,0807031 294631 050e-7 


30 


-5,3897464223335550e-7 


31 


-5,7155423388793000e-7 


32 


-6,05897411 9201 9800e-7 


33 


-6,4209580087853050e-7 


34 


-6,8024432849991 400e-7 


35 


-7,20441 6525901 3450e-7 



Coef # 


Coefficient Value 


36 


-7,627901 81 72077200e-7 


37 


-8,073961 0368280600e-7 


38 


-8,5436937020293650e-7 


39 


-9,0382333224895500e-7 


40 


-9,5587472540498280e-7 


41 


-1,01 064371 2821 3760e-6 


42 


-1, 06825389061 46330e-6 


43 


-1,1 2883236371 55430e-6 


44 


-1,1 9250971 846461 00e-6 


45 


-1 ,25941 93438633260e-6 


46 


-1 ,3296974052300520e-6 


47 


-1,4034829433780260e-6 


48 


-1, 48091 78634979940e-6 


49 


-1, 5621 46591 3980060e-6 


50 


-1, 64731 60377545360e-6 


51 


-1,7365751 553971 340e-6 


52 


-1,83007491 045261 20e-6 


53 


-1 ,9279680902322050e-6 


54 


-2,0304092660990760e-6 


55 


-2,1 375541 686662320e-6 


56 


-2,2495596253776460e-6 


57 


-2,36658281 20950560e-6 


58 


-2,488781 1970856540e-6 


59 


-2,61 631 20571 644820e-6 


60 


-2,7493324266391 140e-6 


61 


-2,8879984579501 380e-6 


62 


-3,0324653639203880e-6 


63 


-3,1828867865974640e-6 


64 


-3,33941 47240855820e-6 


65 


-3,5021 985520037540e-6 


66 


-3,671 3849262294680e-6 


67 


-3,8471 16621 801 5830e-6 


68 


-4,0295324657699800e-6 


69 


-4,2187670209852250e-6 


70 


-4,41 4950504331 2320e-6 



Coef # 


Coefficient Value 


71 


-4,6182073040543900e-6 


72 


-4,8286558658894680e-6 


73 


-5,0464076705003400e-6 


74 


-5,271 5671 455405850e-6 


75 


-5,50423075354581 00e-6 


76 


-5,7444868785028750e-6 


77 


-5,99241 42424804800e-6 


78 


-6,248081 7863072800e-6 


79 


-6,5115476278604000e-6 


80 


-6,7828589565804950e-6 


81 


-7,06205075451 541 00e-6 


82 


-7,3491 4566971 44900e-6 


83 


-7,6441 525098592000e-6 


84 


-7,9470661 364365490e-6 


85 


-8,2578666429227760e-6 


86 


-8,57651 921 547681 00e-6 


87 


-8,902971 89001 22000e-6 


88 


-9,2371 5541 95468400e-6 


89 


-9,57898259882531 00e-6 


90 


-9,9283481 667264540e-6 


91 


-1, 02851 27334608450e-5 


92 


-1, 06491 75634328590e-5 


93 


-1,1020327046334650e-5 


94 


-1,1 398393880881 060e-5 


95 


-1,1 7831 66036693560e-5 


96 


-1,21 7441 0889256530e-5 


97 


-1,2571 871 574469220e-5 


98 


-1 ,297526685803031 Oe-5 


99 


-1,3384289978242060e-5 


100 


-1,3798608540965960e-5 


101 


-1,4217863365843780e-5 


102 


-1,4641668383496450e-5 


103 


-1,506960952767971 Oe-5 


104 


-1,5501244597782600e-5 


105 


-1,59361 01 341 640500e-5 
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D.10 VQ Tables 



D.10.1 ADPCM Coefficients 

Each vector consists of four elements and the Codebook has 2 12 = 4096 vectors. Each entry represents an element 
multiplied by 2 13 . Therefore, the actual value of each element is calculated as follows: 

Entry 

ActualElerrentValue = — — — 
2 13 



For example, the first entry in the table gives: 

9928 



= 1.2119140625 



Due to its extensive size, this table is not included here. 



D.10.2 High Frequency Subbands 

Each vector consists of 32 elements and the Codebook has 2 10 = 1024 vectors. Each entry is 16 bits, representing two 
vector elements, so it takes 16 entries in the table to represent one vector of 32 elements. Each entry is first be split into 
two 8-bit integers and then each divided by 24 to give two vector elements. 

Due to its extensive size, this table is not included here. 



D.1 1 Look-up Table for Downmix Scale Factors 



DmixTbllndex 


LogAbsValues (dB) 


AbsValues 


DmixTable 


InvDmixTbllndex 


InvDmixTbl 





-60,0000 


0,001000 


33 


N/A 


N/A 


1 


-59,5000 


0,001059 


35 


N/A 


N/A 


2 


-59,0000 


0,001122 


37 


N/A 


N/A 


3 


-58,5000 


0,001189 


39 


N/A 


N/A 


4 


-58,0000 


0,001259 


41 


N/A 


N/A 


5 


-57,5000 


0,001334 


44 


N/A 


N/A 


6 


-57,0000 


0,001413 


46 


N/A 


N/A 


7 
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0,001496 


49 


N/A 


N/A 
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0,001585 


52 


N/A 


N/A 
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0,001679 


55 


N/A 


N/A 


10 


-55,0000 


0,001778 


58 


N/A 


N/A 


11 
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0,001884 


62 


N/A 


N/A 


12 


-54,0000 


0,001995 


65 


N/A 


N/A 


13 
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0,002113 


69 


N/A 


N/A 


14 
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0,002239 


73 


N/A 


N/A 


15 
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N/A 
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16 


-52,0000 


0,002512 


82 
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N/A 


19 


-50,5000 


0,002985 


98 


N/A 
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N/A 


23 


-48,5000 


0,003758 


123 


N/A 


N/A 


24 


-48,0000 


0,003981 


130 


N/A 


N/A 
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N/A 


N/A 
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29 
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N/A 


N/A 
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Annex E (informative): 

DTS and DTS-HD formats in ISO Media Files 

This annex specifies how all DTS audio formats are used in media files based on the ISO/IEC 14496-12 [i.l] 
specification. As there are many variations of file formats based on this family of definitions, the present document will 
refer to them generically as ISO media files. 

In general, the system layer definition described in ISO/IEC 14496-1 [i.4] is used to embed the audio. All DTS audio 
formats comply with the conventions described herein. 

A more current version of the definitions described in this annex may be available from DTS at www.dts.com . 



E.1 ISO Defined Variables 

In these clauses, operational rules for boxes defined in ISO/IEC 14496-12 [i.l] and ISO/IEC 14496-14 [i.3] as well as 
definitions of private extensions to those ISO file format standards are described. 

This annex is informative in that it is not required to package the audio formats defined here in ISO media files. 
However, if any of the audio formats specified in the present document are stored in an ISO media file, this annex is to 
be followed. 

An ISO media file may contain one or more audio tracks. The tracks are composed in conformity to ISO base media file 
format and described in ISO/IEC 14496-12 [i.l], for an audio track structure and therefore uses the following: 

E.1 .1 Track Header Box 

The syntax and values for the Track Box and its sub-boxes conform to section 8.5 of ISO/IEC 14496-12 [i.l] and the 
following fields of each box are set to the following specified values: 

flags = 000007h, except for the case where the track belongs to an alternate group; 

layer = 0; 

volume = OlOOh; 

matrix = {00010000h,0,0,0, 00010000h,0,0,0, 40000000h}; 
width = 0; 
height = 0. 

E.1. 2 Sync Sample Box 

As all audio access units are random access points (sync samples), the Sync Sample Box is not present in the track time 
structure of any audio track within an ISO media file. 

E.1 .3 Handler Reference Box 

The syntax and values for the Handler Reference Box conforms to section 8.9 of ISO/IEC 14496-12 [i.l] and the fields 
of this box are set to the following specified values: 

handler_type = 'soun' 
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E.1 .4 Sound Media Header Box 

The syntax and values for the Sound Media Header box conform to section 8.11.3 of ISO/IEC 14496-12 [i.l] and the 
fields of this box are set to the following specified values: 

balance = 



E.2 Storage of DTS elementary streams 

This clause describes how to store DTS elementary streams within an ISO media file. An audio sample consists of a 
single DTS or DTS-HD synchronized audio frame, as defined in the present document. Audio samples are always 
stored in the order of playback for a given track. 



E.3 SampleDescription Box 

The DTS SampleEntry box is derived from the AudioSampleEntry box defined in ISO/IEC 14496-12 [i.l]. The 
dts-specific SampleEntry box is identified by a unique codingname value (see Table E-l). Table E-2 provides additional 
information regarding which codingname is appropriate for a particular elementary stream. 



Table E-1 : Defined Audio Formats 



codingname 


Description 


dtsc 


DTS formats prior to DTS-HD 


dtsh 


DTS-HD audio formats 


dtsl 


DTS-HD Lossless formats 


dtse 


DTS Low Bit Rate (LBR) formats 



E.3.1 DTS_SampleEntry 

DTS_SampleEntry extends the AudioSampleEntry box defined in ISO/IEC 14496-12 [i.l]: 

Class DTS_SampleEntry ( ) extends AudioSampleEntry (codingname) { 

DTSSpecif icBox ( ) // 'ddts' box 

} 

For DTS_SampleEntry(), the following values inherited from AudioSampleEntry are set as follows: 
codingname is according to table E-l. 

channelcount is set to the number of decodable output channels in basic playback, as described in the 'ddts' 
configuration box. Additional channel count as a result of future feature enhancements are defined in a box following 
the 'ddts' box, where ReservedBox() is the placeholder. 

samplesize is always set to 16. 

samplerate is set according to DTSSamplingFrequency of either: 

• 48 000 for original sampling frequencies of 24 000 Hz, 48 000 Hz, 96 000 Hz or 192 000 Hz; 

• 44 100 for original sampling frequencies of 22 050 Hz, 44 100 Hz, 88 200 Hz or 176 400 Hz; 

• 32 000 for original sampling frequencies of 16 000 Hz, 32 000 Hz, 64 000 Hz or 128 000 Hz. 
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E.3.2 AudioSampleEntry Box for DTS Formats 

The syntax and values of the AudioSampleEntry Box conforms to DTSSampleEntry. 

The DTS_SampleEntry follows the syntax of the AudioSampleEntry defined in ISO/IEC 14496-12 [i.l]. 

The configuration of the DTS elementary stream is described in the DTSSpecificBox ('ddts'), within DTSSampleEntry. 
The syntax and semantics of the DTSSpecificBox are defined in clause E.3.3. 

E.3.3 DTSSpecificBox 

The syntax and semantics of the DTSSpecificBox are shown below: 

class DTSSpecificBox { 



unsigned 


int (32) 


size ; 






unsigned 


char 


type [4] = 'ddts'; 






unsigned 


int (32) 


DTSSamplingFrequency; 






unsigned 


int (32) 


maxBitrate ; 






unsigned 


int (32) 


avgBitrate ; 






unsigned 


char 


pcmSampleDepth; 


// 


value is 16 or 24 bits 


bit (2) 




FrameDurat ion ; 


// 


= 512, 1 = 1024, 2 = 2048, 3 = 4096 


bit (5) 




S treamConst ruction ; 


// 


Table E-2 


bit(l) 




CoreLFEPresent ; 


// 


= none; 1 = LFE exists 


bit (6) 




CoreLayout ; 


// 


Table E-3 


bit (14) 




CoreSize ; 






bit (1) 




StereoDownmix 


// 


= none; 1 = embedded downmix present 


bit (3) 




RepresentationType ; 


// 


Table E-4 


bit (16) 




ChannelLayout ; 


// 


Table E-5 


bit(l) 




MultiAs set Flag 


// 


= single asset, 1 = multiple asset 


bit(l) 




LBRDurat ionMod 


// 


= ignore, 1 = Special LBR duration modifier 


bit(l) 




ReservedBoxPresent 


// 


= NoReservedBox, 1 = NoReservedBox present 


bit (5) 




Reserved 


// 


Reserved bits are set to 


ReservedBox ( ) 




// 


optional, for future expansion 



}; 

E.3.3. 1 Semantics 

DTSSamplingFrequency: The maximum sampling frequency stored in the compressed audio stream. 
pcmSampleDepth: The bit depth of the rendered audio. For DTS formats this is usually 24-bits. 

maxBitrate: The peak bit rate, in bits per second, of the audio elementary stream for the duration of the track, including 
the core substream (if present) and all extension substreams. If the stream is a constant bit rate, this parameter has the 
same value as avgBitrate. If the maximum bit rate is unknown, this parameter is set to 0. 

avgBitrate: The average bit rate, in bits per second, of the audio elementary stream for the duration of the track, 
including the core substream and any extension substream that may be present. 

FrameDuration: This code represents the number of audio samples decoded in a complete audio access unit at 
DTSSampling Frequency. 

StreamConstructon: Provides complete information on the existence and of location of extensions in any 
synchronized frame. See Table E-2. For any stream type not listed in Table E-2, this parameter is set to and the coding 
name defaults to dtsh. 
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Table E-2: StreamConstruction 



oireamoonsir ucnon 


Core substream 


Extension substream 


couingname 


Core 


XCH 


X96 


XXCH 


Core 


XXCH 


X96 


XBR 


XLL 


LBR 


1 


■/ 




















dtsc 


2 




■/ 


















dtsc 


3 


■/ 






■/ 














dtsh 


4 


■/ 




■/ 
















dtsc 


5 


■/ 










■/ 










dtsh 


6 


■/ 














■/ 






dtsh 


7 


■/ 


■/ 












■/ 






dtsh 


8 


■/ 






■/ 








■/ 






dtsh 


9 


■/ 










■/ 




■/ 






dtsh 


10 j 


■/ 












■/ 








dtsh 


11 




■/ 










■/ 








dtsh 


12 


■/ 






■/ 






■/ 








dtsh 


13 


■/ 










■/ 


■/ 








dtsh 


14 


■/ 
















■/ 




dtsh 


15 


■/ 


■/ 


















dtsh 


16 


■/ 




■/ 
















dtsh 


17 






















dtsl 


18 




















■/ 


dtse 


19 










■/ 












dtsh 


20 










■/ 


■/ 










dtsh 


21 










■/ 








■/ 




dtsh 



CoreLFEPresent: Indicates the presence of an LFE channel in the core. If no core substream exists, this value is 
ignored. 

CoreLayout: This parameter represents the channel layout of the core within the core substream and is set according to 
Table E-3. If no core substream exists, this parameter is ignored and ChannelLayout or RepresentationType is used to 
determine channel configuration. 



Table E-3: CoreLayout 



Core Layout 


Description 





Mono (1/0) 


2 


Stereo (2/0) 


4 


LT,RT (2/0) 


5 


L, C, R (3/0) 


6 


L, R, S (2/1) 


7 


L, C, R, S (3/1) 


8 


L, R, LS, RS (2/2) 


9 


L, C, R, LS, RS (3/2) 


31 


use ChannelLayout 



For streams where StreamConstruction is undefined (i.e. StreamConstruction = 0), or a DTS core component only exists 
in the extension substream (e.g. StreamConstruction = 19, 20 or 21 in ), CoreLayout is set to 31. 

All undefined values for CoreLayout are reserved for future use. 

CoreSize: The size of a core substream AU in bytes. If no core substream exists CoreSize=0 and parameters 
CoreLayout and CoreLFEPresent are ignored. 

StereoDownmix: Indicates the presence of an embedded stereo downmix in the stream This parameter is not valid for 
stereo or mono streams. 

RepresentationType: This indicates special properties of the audio presentation, as indicated in Table E-4. This 
parameter is only valid when all flags in ChannelLayout are set to 0. If ChannelLayout t- 0, this value is ignored. 
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Table E-4: RepresentationType 



RepresentationType 


Description 





Audio asset designated for mixing with another audio asset 


2 


Lt/Rt Encoded for matrix surround decoding; it implies that total number of encoded 
channels is 2 


3 


Audio processed for headphone playback; it implies that total number of encoded 
channels is 2 


1 and 4 through 7 


Reserved 



ChannelLayout: Provides complete information on channels coded in the audio stream including core and extensions. 
The binary masks of the channels present, as shown in Table E-5, are added together to create ChannelLayout. 



Table E-5: ChannelLayout 



Bit Masks 


Loudspeaker Location Description 


Number of Channels 


0001 h 


Centre in front of listener 


1 


0002h 


Left/Right in front 


2 


0004h 


Left/Right surround on side in rear 


2 


0008h 


Low frequency effects subwoofer 


1 


001 Oh 


Centre surround in rear 


1 


0020h 


Left/Right height in front 


2 


0040h 


Left/Right surround in rear 


2 


0080h 


Centre Height in front 


1 


0100h 


Over the listener's head 


1 


0200h 


Between left/right and centre in front 


2 


0400h 


Left/Right on side in front 


2 


0800h 


Left/Right surround on side 


2 


1000h 


Second low frequency effects subwoofer 


1 


2000h 


Left/Right height on side 


2 


4000h 


Centre height in rear 


1 


8000h 


Left/Right height in rear 


2 



MultiAssetFlag: This flag is set if the stream contains more than one asset. This also implies that a DTS extension 
substream is present. Multiple asset streams use the 'dtsh' coding type. When multiple assets exist, the remaining 
parameters in the DTSSpecificBox only reflect the coding parameters of the first asset. 

LBRDurationMod: This flag indicates a special case of the LBR coding bandwidth, resulting in 1/3 or 2/3 band 
limiting. The result of this is the LBR frame duration is 50 % larger than indicated in FrameDuration. For example, 
when this flag is set to 1, the FrameDuration is 6 144 samples instead of 4 096 samples. 

Reserved: These bits are reserved for future definition. ISO media files created according to this version of 
specification will have these bits set to 0. 

E.3.4 ReservedBox 

The reserved box is optional and serves as a placeholder for future expansion. Additional private boxes may follow the 
'ddts' box in the DTS_SampleEntry(). Playback devices not equipped to support these additional extensions depend on 
the 'ddts' box for basic playback capability. 
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E.4 Restrictions on DTS Formats 

This clause describes the restrictions that apply to the DTS formats encapsulated in an ISO media file. 

E.4.1 General constraints 

The following conditions remain constant in a core substream: 

Duration of Synchronized Frame 

Sampling Frequency 

Audio Channel Arrangement 

Low Frequency Effects flag 

Extension assignment 
The following conditions remain constant in an Extension substream: 

Duration of Synchronized Frame 

Sampling Frequency 

Audio Channel Arrangement including LFE 
Embedded stereo flag 

Extensions assignment defined in StreamConstruction 

E.5 Implementation of DTS Sample Entry 

The information needed to derive the elements of the DTS Sample Entry box and boxes contained within it, may be 
extracted from the respective elementary stream. DTS has tools available to implementers that will analyze DTS 
elementary streams and extract the information necessary to populate these parameters. DTS document #9302J81100, 
describes the function calls and return structures. To obtain this tool and additional documentation, please direct all 
document requests to DTS Licensing at Licensing Administration @ dts .com . 
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Annex F (informative): 

Application of DTS formats to MPEG-2 Streams 

This annex specifies how DTS and DTS-HD audio is applied in MPEG-2 systems and provides additional information 
and references to the usage of DTS and DTS-HD elementary streams in DVB broadcast applications. While the use of 
DTS formats in DVB broadcast is optional, if they are used, this specification should be followed. 

This Annex is informative in that it is not required to use DTS or DTS-HD audio in an MPEG-2 system. However, if 
the audio formats specified in the present document are implemented in MPEG-2 systems, this annex is to be followed. 

Additional information pertaining to DTS formats in other MPEG-2 TS environments may be available at 
www.dts.com. 



F.1 Buffering Model 

The DTS buffering model is designed in accordance with ISO/IEC 13818-1 [i.2]. Refer to the derivation of BS n for 
audio elementary streams. 

• For DTS core streams, the main audio buffer size (BS n ) has a fixed value of 9 088 bytes, with a drain rate 
(Rx n ) of 2 Mbps. The fixed value above (9 088 bytes) was calculated from a double buffer (2x4 096 bytes) 
plus jitter (384 bytes) + packet bursts (512 bytes). 

• For DTS-HD Lossless formats, the value of BS n has a fixed value of 66 432 bytes, with an Rx n value of 
32 Mbps. 

• For all other DTS-HD formats, the value of BS n has a fixed value of 17 814 bytes, with an Rx n value of 
8 Mbps. 

F.2 Signalling 

F.2.1 PSI Signalling in the PMT 

Two related generations of DTS formats exist, the original DTS core format and the expanded DTS-HD format. As a 
result of this second generation of DTS formats, a new DTS-HD audio descriptor was created to accommodate the 
expanded feature set. This new structures can accommodate core only formats as well as extension only and 
core + extension combinations. If an MPEG-2 system supports DTS-HD, all DTS formats broadcast in that system may 
use the DTS-HD signalling as described in clause F.2. 1.2.2. 

F.2. 1.1 Stream Type 

All DTS and DTS-HD elementary streams are signalled as private_stream_l and therefore use a 

stream_type = 0x06, consistent with TS 101 154 [2], clause 4.1.6.1 and in accordance with ITU-T Recommendation 

H.222.0/ISO/IEC 13818-1 [i.2]. 



ETSI 



270 



ETSI TS 102 114 V1. 4.1 (2012-09) 



F.3 Elementary Stream Encapsulation 
F.3.1 Stream ID 

All DTS and DTS-HD elementary streams use a stream_id = OxBD, indicating private stream 1, in accordance with 
ITU-T Recommendation H.222.0/ISO/IEC 13818-1 [i.2]. Multiple DTS/DTS-HD streams may share the same value of 
stream_id since each stream is carried with a unique PID value. The mapping of values of PID to stream_type is 
indicated in the transport stream PMT. 

F.3.2 Calculation of PTS from the elementary stream 

The time duration of one audio access unit can be calculated by dividing the audio frame duration in samples by the 
audio sampling frequency. 

F.3.2. 1 Frame Duration from Core Substream Metadata 

In the case of a core substream, the audio frame duration for a normal (non-termination) frame is NBLKS, (described in 
clause 5.3.1) and the audio sampling frequency can be determined from SFREQ which is described in Table 5-5. Thus 
the frame duration in seconds is: 

frame duration( seconds) = (NBLKS + 1) x 32/Audio Sampling Frequency 

F.3. 2. 2 Frame Duration from Extension Substream Metadata 

The parameters nuRefClockCode and nuExSSFrameDurationCode are used to determine the audio frame duration when 
an extension substream is present. 

The frame duration is expressed by the number of clock cycles using the reference clock indicated by the value in 
RefClockPeriod (Table 7-3). The number of clock cycles is derived from nuExSSFrameDurationCode (clause 7.4.1) in 
the following manner: 

frame duration(seconds) = nuExSSFrameDurationCode x 572 X RefClockPeriod 

F.3.3 Audio Access Unit Alignment in the PES packet 

A valid sync word is aligned with the start of the PES packet data area. Valid DTS sync words are listed in Table F-l. 
Data_Alignment_Indicator in the PES packet header will indicate sync word alignment. 



Table F-1 : DTS-HD Sync Words 



name 


sync word 


description 


DTS SYNCWORD CORE 


0x7ffe8001 


core substream 


DTS SYNCWORD SUBSTREAM 


0x64582025 


extension substream 



When a core substream is present, DTS_SYNCWORD_CORE is aligned to the beginning of the PES payload. When 
only an extension substream is present, DTS_SYNCWORD_SUBSTREAM is aligned to the beginning of the PES 
payload. 

A PES packet of DTS audio will contain at least one complete audio access unit. Multiple complete access units are 
permitted in a PES packet only when the Elementary Stream consists of a single substream. 

The DTS core substream header parameter FSIZE, (clause 5.3.1), indicates the number of bytes in a core frame as 
FSIZE + 1 and the DTS-HD extension substream header parameter nuExtSSFsize, (clause 7.4.1), indicate the number of 
bytes in each extension substream frame (respectively), as shown in Figure F-l. The total PES packet payload of a 
single audio access unit will be the sum of these values for all substreams that are present. 
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If multiple substreams are present, the access units maintain an interleaved order of presentation, as illustrated in 
Figure F-l. 
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access unit 



Figure F-1 : PES packet payload 



F.4 Implementation of DTS and DTS-HD Audio Stream 
Descriptors 

The information needed to derive the elements within the audio descriptors can be derived from the respective 
elementary stream. DTS has tools available to implementers that will analyze DTS elementary streams and extract the 
information necessary to populate these parameters. DTS document #9302J81 100 [i.7], describes the function calls and 
return structures. To obtain this tool and additional documentation, please direct all document requests to DTS 
Licensing at LicensingAdministration@dts.com . 
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